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HE importance of the standpoint afforded by the theory 
of groups for the discovery of the general laws of 


quantum theory has of late become more and more 
apparent. Since I have for some years been deeply concerned 
with the theory of the representation of continuous groups, it 
has seemed to me appropriate and important to give an account 
of the knowledge won by mathematicians working in this field 
in a form suitable to the requirements of quantum physics. An 
additional impetus is to be found in the fact that, from the 
purely mathematical standpoint, it is no longer justifiable to 
draw such sharp distinctions between finite and continuous 
groups in discussing the theory of their representations as has 
been done in the existing texts on the subject. My desire to 
show how the concepts arising in the theory of groups find their 
application in physics by discussing certain of the more important 
examples has necessitated the inclusion of a short account of the 
foundations of quantum physics, for at the time the manuscript 
was written there existed no treatment of the subject to which 
I could refer the reader. In brief this book, if it fulfills its 
purpose, should enable the reader to learn the essentials of the 
theory of groups and of quantum mechanics as well as the rela- 
tionships existing between these two subjects ; the mathematical 
portions have been written with the physicist in mind, and vice 
versa. I have particularly emphasized the “ reciprocity’ be- 
tween the representations of the symmetric permutation group 
and those of the complete linear group; this reciprocity has as 
yet been unduly neglected in the physical literature, in spite of 
the fact that it follows most naturally from the conceptual 
structure of quantum mechanics, 
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There exists, in my opinion, a plainly discernible parallelis 
between the more recent developments of mathematics an 
physics. Occidental mathematics has in past centuries broke 
away from the Greek view and followed a course which seen 
to have originated in India and which has been transmittec 
with additions, to us by the Arabs; in it the concept of numbe 
appears as logically prior to the concepts of geometry. Th 
result of this has been that we have applied this systematicall 
developed number concept to all branches, irrespective of whethe 
it is most appropriate for these particular applications. Bu 
the present trend in mathematics is clearly in the direction of ; 
return to the Greek standpoint; we now look upon each branct 
of mathematics as determining its own characteristic domair 
of quantities. The algebraist of the present day considers the 
continuum of real or complex numbers as merely one “ field ” 
among many; the recent axiomatic foundation of projective 
geometry may be considered as the geometric counterpart of 
this view. This newer mathematics, including the modern 
theory of groups and “abstract algebra,” is clearly motivated 
by a spirit different from that of “ classical mathematics,” which 
found its highest expression in the theory of functions of a 
complex variable. The continuum of real numbers has retained 
its ancient prerogative in physics for the expression of physical 
measurements, but it can justly be maintained that the essence 
of the new Heisenberg-Schrédinger-Dirac quantum mechanics is 
to be found in the fact that there is associated with each physical 
system a set of quantities, constituting a non-commutative 
algebra in the technical mathematical sense, the elements of 
which are the physical quantities themselves. 


ZURICH, August, 1928 
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in mathematical physics in Princeton University. The 

lectures which I gave there and in other American insti- 
tutions afforded me a much desired opportunity to present anew, 
and from an improved pedagogical standpoint, the connection 
between groups and quanta. The experience thus obtained has 
found its expression in this new edition, in which the subject 
has been treated from a more thoroughly elementary standpoint. 
Transcendental methods, which are in group theory based on 
the calculus of group characteristics, have the advantage of 
offering a rapid view of the subject as a whole, but true under- 
standing of the relationships is to be obtained only by following 
an explicit elementary development. I may mention in this 
connection the derivation of the Clebsch-Gordan series, which is 
of fundamental importance for the whole of spectroscopy and 
for the applications of quantum theory to chemistry, the section 
on the Fordan-Hélder theorem and its analogues, and above all 
the careful investigation of the connection between the algebra 
of symmetric transformations and the symmetric permutation 
group. The reciprocity laws expressing this connection, which 
were proved by transcendental methods in the first edition, as well 
as the group-theoretic problem arising from the existence of spin 
have also been treated from the elementary standpoint. Indeed, 
the whole of Chapter V—which was, in the opinion of many 
readers, much too condensed and more difficult to understand 
than the rest of the book—has been entirely re-written, The 
algebraic standpoint has been emphasized, in harmony with the 
recent development of ‘‘ abstract algebra,” which has proved so 
useful in simplifying and unifying general concepts. It seemed 
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impossible to avoid presenting the principal part of the theory 
of representations twice ; first in Chapter III, where the repre- 
sentations are taken as given and their properties examined, 
and again in Chapter V, where the method of constructing the 
representations of a given group and of deducing their properties 
is developed. But I believe the reader will find this two-fold 
treatment an advantage rather than a hindrance. 

To come to the changes in the more physical portions, in 
Chapter IV the rôle of the group of virtual rotations of space 
is more clearly presented. But above all several sections have 
been added which deal with the energy-momentum theorem of 
quantum physics and with the quantization of the wave equation 
in accordance with the recent work of Heisenberg and Pauli. 
This extension already leads so far away from the fundamental 
purpose of the book that I felt forced to omit the formulation 
of the quantum laws in accordance with the general theory of 
relativity, as developed by V. Fock and myself, in spite of its 
desirability for the deduction of the energy-momentum tensor. 
The fundamental problem of the proton and the electron has 
been discussed in its relation to the symmetry properties of the 
quantum laws with respect to the interchange of right and left, 
past and future, and positive and negative electricity. At 
present no solution of the problem seems in sight; I fear that 
the clouds hanging over this part of the subject will roll together 
to form a new crisis in quantum physics. I have intentionally 
presented the more difficult portions of these problems of spin 
and second quantization in considerable detail, as they have 
been for the most part either entirely ignored or but hastily 
indicated in the large number of texts which have now appeared 
on quantum mechanics. 

It has been rumoured that the ‘‘ group pest” is gradually 
being cut out of quantum physics. This is certainly not true 
in so far as the rotation and Lorentz groups are concerned ; 
as for the permutation group, it does indeed seem possible to 
avoid it with the aid of the Pauli exclusion principle. Never- 
theless the theory must retain the representations of the per- 
mutation group as a natural tool in obtaining an understanding 
of the relationships due to the introduction of spin, so long as 
its specific dynamic effect iseneglected. I have here followed the 
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trend of the times, as far as justifiable, in presenting the group- 
theoretic portions in as elementary a form as possible. The 
calculations of perturbation theory are widely separated from 
these gencral considerations; I have therefore restricted myself 
to indicating the method of attack without either going into 
details or mentioning the many applications which have been 
based on the ingenious papers of Hartree, Slater, Dirac and 
others. 

The constants c and h, the velocity of light and the quantum 
of action, have caused some trouble. The insight into the 
significance of these constants, obtained by the theory of rela- 
tivity on the one hand and quantum theory on the other, is 
most forcibly expressed by the fact that they do not occur in 
the laws of Nature in a thoroughly systematic development of 
these theories. But physicists prefer to retain the usual c.g.s. 
units—principally because they are of the order of magnitude of 
the physical quantities with which we deal in everyday life.. 
Only a wavering compromise is possible between these practical 
considerations and the ideal of the systematic theorist; I 
initially adopt, with some regret, the current physical usage, 
but in the course of Chapter IV the theorist gains the upper 
hand. 

An attempt has been made to increase the clarity of the 
exposition by numbering the formule in accordance with the 
sections to which they belong, by emphasizing the more im- 
portant concepts by the use of boldface type on introducing 
them, and by lists of operational symbols and of letters having 
a fixed significance. 


H. WEYL. 


GÖTTINGEN, November, 1930 
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Te translation was first planned, and in part completed, 
during the academic year 1928-29, when the translator 
was acting as assistant to Professor Weyl in Princeton. 
Unforeseen delays prevented the completion of the manuscript 
at that time, and as Professor Weyl decided shortly afterward 
to undertake the revision outlined in the preface above it seemed 
desirable to follow the revised edition. In the preparation of 
this manuscript the German has been followed as closely as 
possible, in the conviction that any alterations would but de- 
tract from the elegant and logical treatment which characterizes 
Professor Weyl’s works. While an attempt has been made 
to follow the more usual English terminology in general, this 
programme is limited by the fact that the fusion of branches of 
knowledge which have in the past been so widely separated as 
the theory of groups and quantum theory can be accomplished 
only by adapting the existing terminology of each to that of 
the other; a minor difficulty of a similar nature is to be found 
in the fact that the development of “ fields ” and “ algebras ” 
in Chapter V is accomplished in a manner which makes it appear 
desirable to deviate from the accepted English terminology. 

It is a pleasure to express my indebtedness to Professor Weyl 
for general encouragement and assistance, to Professor R. E. 
Winger of Union College for the assistance he has rendered in 
correcting proof and in preparing the index, and to the publishers 
for their coöperation in adhering as closely as possible to the 
original typography. 

H. P. ROBERTSON 


PRINCETON, September, 931 
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INTRODUCTION 
[os quantum theory of atomic processes was proposed by 


Nieks Bour in the year 1913, and was based on the 
atomic model proposed earlier by RUTHERFORD. The 
deduction of the Balmer series for the line spectrum of hydrogen 
and of the Rydberg number from universal atomic constants 
constituted its first convincing confirmation. This theory gave 
us the key to the understanding of the regularities observed in 
optical and X-ray spectra, and led to a deeper insight into the 
structure of the periodic system of chemical elements. The issue 
of Naturwissenschaften, dedicated to Bour and entitled ‘ Die 
ersten zehn Jahre der Theorie von Ners Bohr über den Bau 
der Atome ” (Vol. 11, p. 535 (1923)), gives a short account of the 
successes of the theory at its peak. But about this time it began 
to become more and more apparent that the Bour theory was 
a compromise between the old “‘classical’’ physics and a new 
quantum physics which has been in the process of development 
since Planck’s introduction of energy quanta in 1900. BOHR 
described the situation in an address on ‘‘Atomic Theory and 
Mechanics ” (appearing in Nature, 116, p. 845 (1925)) in the 
words: “From these results it seems to follow that, in the 
general problem of the quantum theory, one is faced not with 
a modification of the mechanical and electrodynamical theories 
describable in terms of the usual physical concepts, but with 
an essential failure of the pictures in space and time on which 
the description of natural phenomena has hitherto been based.” 
The rupture which led to a new stage of the theory was made 
by HrIsENBERG, who replaced Bohr’s negative prophecy by a 
positive guiding principle. 
The foundations of the new quantum physics, or at least 
its more important theoretical aspects, are to be treated in this 
XIX 
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book. For supplementary references on the physical side, 
which are urgently required, I name above all the fourth edition 
of SOMMERFELD’s well-known “Atombau und Spektrallinien ” 
(Braunschweig, 1924), or the English translation “Atomic 
Structure and Spectral Lines’’ (London, 1923) of the third 
edition, together with the recent (1929) ‘‘ Wellenmechanischer 
Erganzungsband ” or its English translation ‘‘ Wave Mechanics ” 
(1930). An equivalent original English book is that of Ruarx 
AND Urey, “Atoms, Molecules and Quanta ” (New York, 1930), 
which appears in the “ International Series in Physics,” edited 
by Ricutmever. I should also recommend GerrLacH’s short 
but valuable survey ‘‘ Experimentelle Grundlagen der Quanten- 
theorie” (Braunschweig, 1921). The spectroscopic data, pre- 
sented in accordance with the new quantum theory, together 
with complete references to the literature, are given in the 
following three volumes of the series “Struktur der Materie,” 
edited by Born AND FRANCK :— 

F. Hunp, ‘‘Linienspektren und periodisches System der 
Elemente’’ (1927) ; 

E. Back anD A. Lanpg, “ Zeemaneffekt und Multiplett- 
struktur der Spektrallinien’’ (1925) ; 

W. Grotrian, “Graphische Darstellung der Spektren von 
Atomen und Ionen mit ein, zwei und drei Valenzelektronen”’ 
(1928), ; 

The spectroscopic aspects of the subject are also discussed 
in PauLinc anD Goupsmit's recent “The Structure of Line 
Spectra” (1930), which also appears in the “ International 
Series in Physics.” 

The development of quantum theory has only been made 
possible by the enormous refinement of experimental technique, 
which has given us an almost direct insight into atomic 
processes. If in the following little is said concerning the 
experimental facts, it should not be attributed to the mathe- 
matical haughtiness of the author; to report on these things 
lies outside his field. Allow me to express now, once and for 
all, my deep respect for the work of the experimenter and for 
his fight to wring significant facts from an inflexible Nature, 


who says so distinctly ‘““No’’ and. so sa “Yes” to 
our theories, 
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Our generation is witness to a development of physical 
knowledge such as has not been seen since the days of KEPLER, 
GALILEO AND Newton, and mathematics has scarcely ever 
experienced such a stormy epoch. Mathematical thought 
removes the spirit from its wor'dly haunts to solitude and 
renounces the unveiling of the secrets of Nature. But as 
recompense, mathematics is less bound to the course of worldly 
events than physics. While the quantum theory can be traced 
back only as far as 1900, the origin of the theory of groups 
is. lost in a past scarcely accessible to history; the earliest 
works of art show that the symmetry groups of plane figures 
were even then already known, although the theory of these 
was only given definite form in the latter part of the eighteenth 
and in the nineteenth centuries. F. Krern considered the 
group concept as most characteristic of nineteenth century 
mathematics. Until the present, its most important application 
to natural science lay in the description of the symmetry of — 
crystals, but it has recently been recognized that group theory 
is of fundamental importance for quantum physics; it here 
reveals the essential features which are not contingent on a 
special form of the dynamical laws nor on special assumptions 
concerning the forces involved. We may well expect that it is 
just this part of quantum physics which is most certain of a 
lasting place. Two groups, the group of rotations in 3-dimen- 
stonal Space and the permutation group, play here the principal 
rôle, for the laws governing the possible electronic configurations 
grouped about the stationary nucleus of an atom or an ion are 
spherically symmetric with respect to the nucleus, and since the 
various electrons of which the atom or ion is composed are 
identical, these possible configurations are invariant under a 
permutation of the individual electrons. The investigation of 
groups first becomes a connected and complete theory in the 
theory of the representation of groups by linear transformations, 
and it is exactly this mathematically most important part 
which is necessary for an adequate description of the quantum 
mechanical relations. All quantum numbers, with the exception 
of the so-called principal quantum number, are indices character- 
izing representations of groups. 
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This book, which is to set forth the connection between groups 
and quanta, consists of five chapters. The first of these is 
concerned with unitary geometry. It is somewhat distressing 
that the theory of linear algebras must again and again be 
developed from the beginning, for the fundamental concepts 
of this branch of mathematics crop up everywhere in mathe- 
matics and physics, anda knowledge of them should be as 
widely disseminated as the elements of differential calculus. 
In this chapter many details will be introduced with an eye 
to future use in the applications; it is to be hoped that in 
spite of this the simple thread of the argument has remained 
plainly visible. Chapter II is devoted to preparation on the 
physical side; only that has been given which seemed to me 
indispensable for an understanding of the meaning and methods 
of quantum theory. A multitude of physical phenomena, which 
have already been dealt with by quantum theory, have been 
omitted. Chapter III develops the elementary portions of the 
theory of representations of groups and Chapter IV applies them 
_ to quantum physics. Thus mathematics and physics alternate 
in the first four chapters, but in Chapter V the two are fused 
together, showing how completely the mathematical theory is 
adapted to the requirements of quantum physics. In this last 
chapter the permutation group and its representations, togethcr 
with the groups of linear transformations in an affine or unitary 
space of an arbitary number of dimensions, will be subjected to 
a thorough going study. 
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CHAPTER I 
UNITARY GEOMETRY 
§1. The n-dimensional Vector Space 


HE mathematical field of operation of quantum mechanics, 
as well as of the theory of the representations of groups, | 
is the multi-dimensional affine or unitary space. The 
axiomatic method of developing the geometry of such a space 
is no doubt the most appropriate, but for the sake of clearness 
I shall at first proceed along purely algebraic lines. I begin 
with the explanation that a vector x in the n-dimensional 
linear space R = R, is a set of n ordered numbers (X1, £a * * *, Xa); 
vector analysis is the calculus of such ordered sets. The two 
fundamental operations of the vector calculus are the multiplica- 
tion of a vector x by a number a and the addition of two vectors y 
and. On introducing the notation 


t= (Xy, T Xn), )= (Yi Ya ° °°, Vn) 


these operations are defined by the equations 


ay = (ax, AXo, CON AXn) X + D = (x1 + Yis Xa + Y2 C., 
tn F Yn) 


The fundamental rules governing these operations of multiplica- 
tion by a number and addition are given in the following table 
of axioms, in which small German letters denote arbitrary 
vectors and small Latin letters arbitrary numbers : 


(a) Addition. 


l. a + b = b + a (commutative law). 

2. (a -+ b) + c= a + (b + ¢) (associative law). 

3. a and c being any two vectors, there exists one and only one 
vector y for which a + g =c. It is called the difference c — a of 
c and a (possibility of subtraction). 
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1. (a + bje = (ag) + (br) (first distributive law). 
2. es = (ab)z (associative law). 
i; ] 


t= 
alg + 4) = (at) + (ah) (second distributive law). 
The existence of a vector 0 = (0, 0, » + +, 0) with the property 
rto=0+r=2 


need not be postulated separately as it follows from the axioms. 

Affine vector geometry concerns itself entirely with concepts 
which are defined in terms of the two fundamental operations 
with which the axioms (x) and (8) are concerned; we mention 
a few of the most important. A number of vectors a,, Qz, ° °°, Qa 
are said to be linearly independent if there exists between them 
no homogeneous linear relation 


C40, F CoM. + . * + 6,0, = 0 
except the trivial one with coefficients 


& = 0, c= 0, ee ¢, = 0. 


h such vectors are said to span an h-dimensional (linear) sub= 
space Ñ consisting of all vectors of the form 


t= Et + Se +: °° + Eran 1.1) 


where the €’s are arbitrary numbers. It follows from the 
fundamental theorem on homogeneous linear equations that 
there exists a non-trivial homogeneous relation between any 
h +- 1 vectors of R’. The dimensionality h of R’ can therefore 
be characterized independently of the basis : every h + 1 vectors 
in R are linearly dependent, but there exist in it A linearly 
independent vectors. Any such system of h independent 
vectors 0, Mp, ° * *, G, in R can be used as a co-ordinate system 
or basis in R’ ; the coefficients é, a, + + *, éa in the representation 
(1.1) are then said to be the components of x in the co-ordinate 
system (üz, Gg, * * *, Aa). 
The entire space R is n-dimensional, and the vectors 
êi = (1, 0, 0, me 0), 


e = (0,1, 0,-> *, 0), (1.2) 


en = (0, 0,0,* + 1) 
define a co-ordinate system in it in which the components of a 
vector 

p= (41, fate’, £a) 
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agree with the “ absolute components ”’ +; : 
T = yey F tala H’ e t+ Ten. 


From the standpoint of affine geometry, however, the ‘‘ absolute 
co-ordinate system ” (1.2) has no’preference over any other which 
consists of n independent vectors of R. We now add to the 
previous axioms, which did not concern themselves with the 
dimensionality n, the following dimensionality axiom : 

(y) The maximum number of linearly independent vectors in R 
1s N. 

These axioms (a), (8), and (y) suffice for a complete formula- 
tion of vector calculus, for if €,, €s, © ° +, €n are any n independent 
vectors and y is any other vector there must necessarily exist 
a linear dependence 


AL F alr F Age, ++ * + + anen = 0 


between them. Since not all the coefficients may vanish we 
must in particular have a + 0, and consequently any vector r 
can be expressed as a linear combination 


X = Xili F Xas fe nln (1.3) 


of the “fundamental vectors ’’ @, êa © °°, ên. We specify x by 
the set (X4, Xa, © * *, Xn) of components in this co-ordinate system. 
In accordance with axioms (æ) and (£) for addition and multi- 
plication we then have for any two vectors (1.3) and y 


AL= (axe t + + (aXn)èn, LAY (2HY) at °° + (tnt Yn En, 


and we arrive at the definitions from which we started. The 
only—but important—difference between the arithmetic and 
the axiomatic treatment is that in the former the absolute co- 
ordinate system (1.2) is given the preference over any other, 
whereas in the latter treatment no such distinction is made. 

Given any system of vectors, all vectors g which are obtained, 

s (1.1), by linear combinations of a finite number of vectors 
Qj, Qe, © * *, Aa of the system constitute a (linear) sub-space—the 
sub-space ‘‘ spanned '’ by the vectors a. 

R is said to be decomposed or reduced into two linear sub- 
spaces ft’, R” (R= R + R”) if an arbitrary vector x can be 
expressed uniquely as the sum of a vector ry’ of R’ and a vector 
r” of R”. A co-ordinate system in Rt’ and a co-ordinate system 
in R” constitute together a co-ordinate system for the entire 
space R; this co-ordinate system in R is ‘‘ adapted” to the 
decomposition R' + R”. The sum n’ + n” of the dimension- 
alities of W and R” is equal to n, the dimensionality of R. 


— 
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. f ’ R” have no vector except 0 
ersely, if the sub-spaces R’, R” i A P 
POOE and if the sum of their dimensionalities is » then 
R —_ R + R”, l l 
R being an n-dimensional sub-space, two vectors g and ġ are 
said to be congruent modulo R : 


z = b (mod. R’), 


if their difference lies in ft’. Congruence satisfies the axioms 
postulated of any relation of equality: every vector is congruent 
to itself; if g = (mod. R) then ġ =g (mod. R’); if g =p 
(mod. R’) and ) =%(mod. R’), then g = 4 (mod. R’). It is 
therefore permissible to consider vectors which are congruent 
mod. R’ as differing in no wise from one another ; by this ab- 
straction, which we call projection with respect to R’, the 
n-dimensional space M gives rise to an (n — n’)-dimensijonal 
space R. is also a vector space, for from 


X = fa Yı = Ya (mod. R’) 
follow the relations 


a, = afa % +41 = ba + Ye (mod. R’). 


The operations of multiplication by a number and addition can 
therefore be considered ones which operate directly on the 
vectors x of È. All vectors g of R which are congruent mod. R 
give rise to the same vector ¢ of R. If R’ is one-dimensional 
and is spanned by e the above process is the familiar one of 
parallel projection in the direction of e: it is not necessary to 
give an (n — I)-dimensional sub-space of R om to which the 
projection is made. 

If a is a non-null vector, all vectors ¢ which arise by multi- 
plying a by a number are said to lie on the same ray asa. Two 
non-null vectors determine the same ray when, and only when, 
one is a multiple of the other. In a given co-ordinate system 
the vector a is characterized by its components ai, Ay ° ° *, ay 
whereas the ray a is characterized by their ratios a: a@g:- + +: a,; 
these ratios have meaning only when the components of a do 
not all vanish, i.e. only when a = 0. 

The transition from one co-ordinate system €; to another e,' is 
accomplished by expressing the new co-ordinate vectors e; in 
terms of the old: 


s n 
, 
Cy = lik êi 
i=l 


LINEAR CORRESPONDENCES 5 


If x;, x are the components of an arbitrary vector y in the old 
and in the new co-ordinate systems, respectively, then 


g = Lik; ê; = Žir Èk, 
t 
from which the law of transformation 


n 
Xi = Ati Xe (1.4) 
follows. The requirement that the co-ordinate vectors e,’ also 
be linearly independent is expressed arithmetically by the non- 
vanishing of the determinant of the coefficients a;» The com- 
ponents of vectors y, ),- + > in R undergo the same transformation 
on transition to the new co-ordinate system e,’ and are said to 
transform cogredienily. 


§2. Linear Correspondences. Matrix Calculus 


The formula (1.4) can, however, be otherwise interpreted ; 
it is the expression of a linear or affine correspondence or 
mapping of the space R on itself. But for this purpose it 
will be found more convenient to interchange the rôles of the 
accented and the unaccented co-ordinates. On employing a 
definite co-ordinate system €, the equation 


n 
Xr = X lir Hy (2.1) 
k=] 


associates with an arbitrary vector g with components x; a vector 
t’ with components x,’._ This correspondence A : >t’ of R on 
itself can be characterized as linear by the two assertions: if 
t, 0 go over into r’, 9’, then az goes over into ar’ and g + Ņ into 
t’ +y. Linear correspondences therefore leave all affine rela- 
tions unaltered; hence their prominence in the theory of affine 
geometry. In order to show that these two conditions fully 
determine the linear correspondence (2.1), consider the following : 
if a correspondence A which satisfies these conditions sends the 
fundamental vector è, over into 


Ce = Alinei (2.2) 
(7 


then, in consequence of the above requirements, 
T = Xie H’ ee + nln 
goes over into 
E T tet Ht E P N 
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On substituting (2.2) in this equation we see that the new vector 
y has in the co-ordinate system €; the components x,’ obtained 
from the components x, of p by means of (2.1). It has become 
customary in quantum physics to call the linear correspondences 
of a vector space R operators which operate on the arbitrary 
vector g of Ri. 

Let A, B be two linear correspondences, the first of which 
sends the arbitrary vector y over into g = Ar, while the second 
sends y’ into g” = Br’ = B(At). The resultant correspondence 
C, which carries ¢ directly into g”, is also linear and is denoted 
by (BA) (to be read from right to left!) : 


(BA)y = B(Ay). 


This “ multiplication "’ satisfies laws which are similar to those 
of multiplication of ordinary numbers; in particular, the as- 


sociative law 
C(BA) = (CB)A 


is here valid, but the commutative law is not—in general 
AB + BA. The* 1” in this domain, which we here denote by 
1, is the identity, i.e. that correspondence which associates every 
vector t with itself: g — xy. Hence 


A1=1A= A. 


The correspondence A is then and only then reversible in case 
it is non-degenerate, i.e. if it carries no non-vanishing vector into 
the vector 0, or if distinct vectors are always carried over into 
distinct ones. The algebraic condition for this is the non- 
vanishing of the determinant CAA = det 4; there then exists 
the inverse correspondence 47! : 


AAT! = AA = 1. 
The multiplication theorem for determinants states that 
det (BA) = det B » det A. 


t 


Not only can we “ multiply ” two correspondences, we can 
also “add” them. This concept of addition arises quite natur- 
ally: if the arbitrary vector x is sent over into x,’ by A and into 
ta by B, then that correspondence which sends g into g; + fy is 
also linear and is denoted by A + B.: 


(A + B)g = Ag + Br. 
We may also introduce multiplication by an arbitrary number 
a: aA is that correspondence which sends y into a( Ag). Addition 
and multiplication by a number obey the same laws as the 
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analogous operations on vectors. Addition is commutative, 
and has as its inverse subtraction. The rôle of 0 is played by 
the correspondence 0 which transforms every vector r into the 
vector 0. Addition obeys the distributive law with respect to 
multiplication : 


(A+ BNC = AC + BC, | C(A + B) = CA + CB, 
(aA)C = a{ AC), C(ad) = a(CA). 

Before proceeding to the arithmetical expression of these 
operations in a given co-ordinate system, we consider another 
natural generalization. We can map an m-dimensional vector 
space KR linearly on an n-dimensional space ©; this is accom- 
plished when with each vector g of R a vector t) of © is associated 
in such a way y > y that from xy, > Yı, Ta > Ye it follows that 


Ay > ayy, Ti F E —> + Ye 


Such a correspondence A:f->¥ is expressed by equations of 
the form 


m 
Yy = Ate Xi (k = 1, 2, oy n n) (2.3) 


where x, * * *, £m are the components of p in a given co-ordinate 
system in the space R and yı, + * +, Yn have the corresponding 
interpretation in ©. With this correspondence A there is 
associated the matrix 


Arr Ayn ss s lim 
Qe, Ago. . - dom 


Any Ang » + + Anm 
with n rows and m columns, and which we also denote by 
the same letter A. The first index indicates the row and the 
second the column to which a,, belongs. We can also add corre- 
spondences of the same space N on the same space ©. Addition 
and multiplication by a number is accomplished on matrices by 
subjecting their n -m components to these operations: if 
A = {ay || and B= || bx || 
then 
aA = || @* api Il; A+B= || lri + bxi ||. 
If we have a third (p-dimensional) vector space Ẹ, the consec- 
utive application of the correspondences A :g — y of R on © and 
B: -3 of G on gives rise to the correspondence C = BA: t > 4 
of R on T. This composition is expressed in terms of matrix 
components by the law 


ete b= 1.248 3p 
Cii = Ones € See a (2.4) 
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B has p rows and n columns and A n rows and m columns; the 
composition of matrices is possible when the first factor B has 
the same number of columns as the second factor A has rows. 
The component or element cı, which is found at the intersection 
of the l row and the i® column, is formed in accordance with 
(2.4) from the components in the / row of B and the i” column 
of A. An important special case is that in which Ẹ is the same 
space as Ñ; A is then a correspondence of R on ©, B of © on R. 
Already here concepts of the theory of groups play an important 


rôle; on beginning Chapter III, which deals with the theory of 
groups, the reader should return to the matter here discussed 
as an illustration. 

The matrix calculus allows us to express the formule for 
a linear correspondence, such as (2.3), in an abbreviated form. 
We do this by denoting by x that matrix whose only column 
consists of the vector components %, %a ** *, Xm; similarly 
for y. In accordance with the rule (2.4) for the composition of 
matrices, equations (2.3) can be written 


y = Ax. (2.5) 


_. pena em CRE ITSO OOS ET - 
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This form is particularly useful in examining the effect on the 
matrix A of a linear correspondence of a space R on a space © 
when the original co-ordinate systems are replaced by new ones. 
If this change of co-ordinates is effected by the transformations 


xi = Vsij0; or x= Sx in K, 
j 
Ye = Zia Yn or y= Ty’ in ©, 
then from (2.5) 
Ty = ASx’ or y = (TAS). 


Fic. 2. 


The same correspondence in the new co-ordinates is therefore 
expressed by the matrix 


A’ = TIAS. (2.6) 


Let us now return to the linear correspondence A of a space 
R on to itself. If R’ is a linear m’-dimensional sub-space of R 
we say that A leaves invariant if it carries any vector of W, 
over into a vector of R’. If the co-ordinate system is so chosen 
that the first n’ fundamental vectors lie in R, the matrix of 
a correspondence which leaves R’ invariant will assume the 
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form given by Fig. 1. All elements in the rectangle of n’ columns 
and n — n’ rows denoted by zeros in Fig. 1, vanish. 4 contains 
a correspondence of ft’ on to itself and at the same time a corre- 
spondence of the space $, arising by projecting R with respect 
to R’, on to itself. The matrices of these correspondences con- 
sist in the shaded squares. If R is decomposed into R, + Ra 
(nı + ng = n), and if the correspondence A leaves both sub- 
spaces R, and R, invariant, then 4 is completely reduced 
into a correspondence of R, on itself and a correspondence of 
Ra on to itself. If the co-ordinate system is adapted to the 
decomposition R, + R., the matrix A is completely reduced into 
two square matrices arranged along the principal diagonal as 
in Fig. 2. The unshaded rectangles are empty—the elements 
situated in these portions are all zero. 

Let the n-dimensional linear space be decomposed into 
sub-spaces R, + R; -+> + +, Ra having the dimensionality na: n is 
then equal to the sum n4 + na + * °". Any vector g can then be 
written uniquely as the sum of components g + £e + - * - which 
lie in the sub-spaces Rı, Ra, > +» The association g > zy is 
a linear correspondence Ea of R on to Re. Given a correspond- 
ence 4:¢— yr of R on to itself, we consider that linear corre- 
spondence [A]ag which carries an arbitrary vector ¢ of Rg over 
into the component fa’ in Ra of g’. We call [Alag the portion of 
A in which Ra intersects Re. This terminology arises from the 
matrix representation of A; on adapting the co-ordinate system 
to the decomposition R, + R, +» » » the set of variables x, or 
rather their indices i which number the rows and columns of 
the matrix, is broken up into segments of lengths na (a = 1, 2, ++). 
The matrix A is thereby divided into the single rectangles 
[A] ap in which the aè set of rows intersects the 6" set of columns, 
and which consist of na * ng elements. 

If A is the matrix of a correspondence of R on to itself in 
a given co-ordinate system, and A’ its matrix in a co-ordinate 
system obtained from the first by means of the reversible 
transformation S, then in accordance with (2.6) 


A! = S-1AS. (2.7) 


The search for an invariantive characterization of correspondences 
may be formulated algebraically: to find expressions which 
are so formed from the components of an arbitrary matrix that 
they assume the same value for equivalent matrices, ie. for 
matrices A, A’ between which a relation (2.7) exists. The way 
in which this can be accomplished is indicated by the related 
problem of finding a vector x +0 which is transformed into 
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a multiple Ar of itself under the influence of A. The column v 
of the components of g must then satisfy the equation 
àx = Ax, or (AL— A)x = 0. 


But n linear homogeneous equations in n unknowns have a 
non-vanishing solution only if their determinant vanishes; the 
multiplier A is therefore necessarily a root of the ‘‘ characteristic 
polynomial” 

f(A) = det (Al — A) (2.8) 
of A. This polynomial is an invariant in the above sense, for 
from (2.7) or SA’ = AS it follows that 


S(AL — A’) = (Al — A)S, 
whence by the theorem concerning the multiplication of deter- 
minants 
det S- det (AL — A’) = det (Al — A) - det S. 


Since the determinant of the reversible transformation S cannot 
vanish, we can divide by it and obtain the required identity 


[AL — A’| = [AL — Al. 
The characteristic polynomial is of degree n in À: 
f(a) m- An m SA"! -+ o.. + Sn 


whose coefficients, certain integral functions of the elements 
Gy, are invariants of the correspondence A. The “norm” S, 
is merely the determinant of A. The first coefficient s,, the 


frace 
Sy = lu F a bt F an = trA (2.9) 
is of more importance, as it depends linearly on the a,, : 
tr(A, + A,) = trA, + tráz 


If A is a linear correspondence of the m-dimensional vector 
space R on the n-dimensional space ©, and B is conversely a 
linear correspondence of © on R, then we can build the corre- 
spondences BA of R on to itself and AB of © onto itself. These 
two correspondences have the same trace 


tr(BA) = tr(AB) (2.10) 


for, in accordance with the rule of composition (2.4) and the 
definition (2.9) we have 


tr(BA) = Lbir api tr(AB) = Lari bir 
ik ik 
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where 7 runs from 1 to m and k from 1 to n. The special 
in which A and B are both correspondences of R on to i 
naturally deserves particular consideration. 


§8. The Dual Vector Space 
A function L(g) of the arbitrary vector g of the form 


OX F Qata ti * + EnEn ( 


is called a linear form. This concept is invariant in the sens 
affine geometry: it can be defined by means of the functic 
properties 

Let) = aL), Le +9) = Lie) + Lid). 


It is obvious that the expression (3.1) has these properties, 
conversely, on introducing a co-ordinate system ê; and set 
t = 2'x,e,;, it follows that 


L(t) = DEF L(e;) = ous Xi; QS L(e,). 


On going over to another co-ordinate system such that 
components x, of an arbitrary vector y undergo the transfor 
tion (1.4), the linear form becomes 


D hy; = D/H k 


the coefficients a,’ of which are related to the original a; by - 
equations 


1 
k = 2 Aix ” Ky. 
3 


The coefficients a, of a linear form are said io transform cont 
grediently to the variables x, 

It is, however, not necessary to consider the «; as consta 
and the x, as variables. When the a, do not all vanish the eq! 
tion L(g) = 0 defines a “plane,” i.e. an (n — 1)-dimensio: 
sub-space; a vector x lies in the plane if its components sati 
this equation. But onthe other hand we can ask for the equat: 
of all planes which pass through a given non-vanishing vector ; 
the x; = x; are then constants and the «, variables. It is the 
fore most appropriate to consider the two sets (xi, Xa ° °°, % 
(1, Xe, ° * °y On) in parallel. 

We therefore introduce in addition to the space R a seco 
n-dimensional vector space, the dual space P. From the co. 
ponents (én, Ea *> °°, Én) of a vector E of P and a vect 
(Xi, Xa, * * *, Xn) of Jt we can construct the inner or scalar prod: 


Eata + Sexe tse t+ En tn (3 
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This product has, by definition, an invariantive significance, for 
when & is referred to a new co-ordinate system by means of 
a transformation of the x, the variables é; of the dual space P 
undergo the contragredient transformation. This dual space is 
in fact introduced in order to enable us to associate a contra- 
gredient transformation with each one-to-one transformation. 
To repeat, two linear reversible transformations 


x= Ax’, E= A’ (3.3) 


are contragredient with respect to each other if they leave (3.2) 
unaltered : 


Erki + Sexe ts + H Én Xn = Sa ak TE? te i a PEPE (3.4) 


A vector g of R and a vector £ of P are said to be in involution 
when their product (3.2) vanishes. A ray in R determines a 
plane in P, i.e. the plane consisting of the vectors which are in 
involution with the given ray, and conversely. Duality is 
a reciprocal relationship.f 

The dual or transposed matrix A* of a matrix A = |la,,|| 
is obtained by interchanging the rows and columns of A. 
A* = |laj|| is therefore defined by af, = ap and has m rows 
and n columns. We shall always employ the asterisk to in- 
dicate this process. And what is its geometrical interpretation ? 
Let R be an m-dimensional, © an n-dimensional, vector space ; 
A:%— t a linear correspondence of R on ©, specified in terms 
of given co-ordinate systems in R and © by the matrix 4: 


Ve = Lani Xi, 
and let P, 2’ be the dual spaces. The product 
21 Ve = 2 Ari Ne Hal = A Ei xi), 
yt i 


where 7 is an arbitrary vector of & with components n, has then 
an invariantive significance. A bilinear form which depends 
linearly on a vector y of 2 and a vector ț of R is therefore in- 
variantively associated with a linear correspondence of R on ©, 
and conversely. This gives rise, as the expression of the bi- 
linear form given in parentheses shows, to a correspondence 


n> €: Ei = Ze Me 


of & on P, i.e. the dual A* of A. The reciprocal relation existing 
between the correspondence A and its dual A* may be expressed 


t In the theory of relativity it is usual to call vectors in R and P contra- 
variant and covariant vectors, respectively. 


Aa ete at 
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as follows: if ¢ is an arbitrary vector in R and y is an arbitrary 
vector in 2, then the product of the vectors Ay and 7 is equal 
to the product of g and A*y. The dual correspondences obey 
the linear laws 


(Ay + Ae)* = A" + A,*, (@A)* = a: At. 


If A is a correspondence of R on © and B a correspondence of 
© on F, then since 
(BA)* = A*B* (3.5) 


BA maps ®t linearly on %, and A*B* maps the dual space T 
of Ẹ on the dual P of R. 

We have agreed once and for all to consider the set 
Xi, fy ‘°°; %, of components of a vector g as a column; the 
inner product of the vector g in R with the vector £ in P can 
therefore be written in matrix notation as é*x or x*& The 
transformations (3,3), from the first of which it follows that 
x* = x’*A* are consequently contragredient to one another if 


A*A=1 or A= (A*)-}, (3.6) 


and we have arrived at an explicit expression for the contra- 
gredient transformation. 

Let R' be an n-dimensional sub-space of R =R, All 
vectors of P which are in involution with the totality of vectors 
of ft obviously constitute, in consequence of the simplest 
theorems on linear homogeneous equations, an (n — n’)-dimen- 
sional sub-space P’ of P. And from this we are led immediately 
to the result that if a correspondence A of R on tiself leaves the 
sub-space R invariant, then the dual correspondence A* of P on 
itself leaves the associated sub-space P' invariant. 

Let R be decomposed into two or more sub-spaces 
Rı + R.+: ++ of dimensionalities n, na °° -, and let the 
sub-space of P which consists of all vectors in involution with 
all vectors of Ra + ig +: * - be denoted by Py, the dimension- 
ality of which is also ,. Defining P,, P, analogously, we arrive 
at the decomposition P = P, + Pa +- for the sum of a 
vector of P, a vector of P,, etc., can only be. zero when each 
of the individual summands vanishes. In order to prove this 
latter statement, we note that if the sum is 0 then the first 
summand belongs to P, as well as to P, + P3+°° ', ie. it is 
in involution with all the vectors of R, + R +: +: as well as 
with all those of R, and is therefore in involution with all the 
vectors of R. But this is only possible if this first, and therefore 
any, summand is zero. P, can be considered as the space dual 
to R,, for if y is an arbitrary vector in R, and 7 a vector in P 
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with components 7 in the various P,, then the product of 
g and 7 is equal to the product of g and 7”). 

If a correspondence A of K on itself leaves the n’-dimensional 
sub-space R’ invariant, then the (n — n’)-dimensional sub-space 
P’ is invariant under the dual correspondence A* of P on itself. 
If R is decomposed into Rı + Ra +° ++ and if A leaves each 
of the sub-spaces R, invariant, then A* leaves each of the sub- 
spaces P, invariant. If A is any correspondence in R and [A],, 
that portion in which Re intersects Rg, then the portion [A*]g, 
of A* in which P, intersects P, is dual to [A],,: 


LA") pa = [A] "ap. (3.7) 


[A]lag Maps Rg on Ji, and [A*],, maps the dual space P, on Py. 
All these results are conceptually evident, but can be seen 

even more readily directly from the matrices on adapting the 

co-ordinate system to the decomposition R, + R,+-::. 


§4, Unitary Geometry and Hermitian Forms 


The metric is introduced into affine geometry by means of 
a new fundamental concept: the absolute magnitude of a vector. 
In Euclidean geometry the sum of the squares 


Ge aia xP te + x,” (4.1) 


of the components of a vector g = (X1, Xa °° *, %,) is taken as 
the square of its absolute value. The only co-ordinate systems 
which are then equally permissible are the Cartesian systems, 
in which the square of the absolute value of ¢ is given by (4.1) 
in terms of the components x,; the range of values which the 
components may here assume is taken as the continuum of all — 
real numbers. But the content of the preceding paragraphs 
is not bound to this choice; the only requirement is, in fact, 
that the range of permissible values constitute a ‘‘ field ” in 
which the four fundamental operations (excluding division by 
zero) can be performed. We shall hereafter consider the con- 
tinuum of all complex numbers as the range of values which our 
components may assume. The expression (4.1) loses its definite 
character in this domain; the sum of the squares can vanish 
without implying that each term is zero. It is therefore desirable 
to replace the quadratic form (4.1) by the “unit Hermitian 
form ” 


yy + Feta tt + EnEn (4.2) 


where & denotes the complex conjugate of a number x. The 
value g? of (4.2) will be taken as the square of the absolute 


SSS) Woe a 
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magnitude of the vector g = (x3, %o,'**, £n) and the correspond- 
ing bilinear form 


(ch) = Mit Ya te H enn 


as the scalar product (rn) of the two vectors r and ) = 
(Yu Ya ***, Yn) A co-ordinate system is said to be normal 
when the square of the absolute magnitude of a vector g is 
expressed in terms of its components x; in this co-ordinate 
system by (4.2), In a normal co-ordinate system e, these 
components are the scalar products 


Xi = (e,2). (4.3) 
The transformations which lead from one normal co-ordinate 
system to another such, which therefore leave the form (4.2) 
invariant, are called unitary transformations .,Ț 
The conditions which characterize unitary transformations 
are entirely analogous to those for orthogonal transformations, 
with which we are familiar from the elements of analytic geo- 
metry. Let x= Sx’ be such a transformation; under the 
influence of S the fundamental metric form (4.2) goes over into 
#'*S*Sx’. S is therefore unitary if and only if S*S=1; the 
fact that det S +0 follows immediately from this. Indeed, 
since a matrix S and its transposed S* have the same deter- 
minant, it follows that the determinant of a unitary transformation 
has the absolute value 1:|det S|} 1. These conditions may 


be expressed by the assertion that S* is the matrix S-! reciprocal 


to S, and therefore not only S*S = 1 but also SS* = 1. The 
first of these equations states that the sum of the squares of 
the absolute values of the elements of a column is 1 and that 
the sum of the mixed products ZS riSrh of two different columns 


(i +k) is 0; the second equation contains the same assertion 
for the elements of the rows. 

We carry over the terminology usual in Euclidean geometry. 
In particular, the vector ) is said to be perpendicular to ç if 
the scalar product (ry) vanishes. In virtue of the symmetry law 


(bx) = (tb) 
perpendicularity is a reciprocal relationship. There exists no 
vector a, except a = 0, to which all vectors are perpendicular ; 
in fact, a = 0 is the only vector which is perpendicular to itself. 
Normal co-ordinate systems can be characterized by the fact 
t The name “orthogonal” has been used in the physical literature to 


denote these transformations, but in mathematics it is necessary to have 
different names for these two different concepts. 
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that for them the scalar products of the fundamental vectors 
e, among themselves are 


> . flG@=k) 

(e; ex) a Six — {0 (i 4- k). 
_ On comparing the fundamental metric form (4.2) with (3.2) 
it is seen that the unitary space R can be characterized by the 


fact that its conjugate complex R coincides with its dual P, or 


more precisely, that the conjugate complex rt of a vector g can 
at the same time be considered as its dual. We found that with 
a correspondence A of an m-dimensional unitary space R on 
an n-dimensional © is associated in an invariant manner the 
correspondence A* of the dual space Æ on the dual P. As a 


consequence of the equation P = R for unitary spaces 
At*=A 
is a correspondence of © on R; we call it the ‘ Hermitian 


conjugate of A.” AA is a correspondence of R on itself, 


AA of © on itself. A correspondence © which carries the 
general vector g over into r= St is unitary if it leaves the 
absolute magnitude of y unaltered: g’? = y’. Two configura- 
tions consisting of vectors, either of which can be obtained from 
the other by a unitary transformation, are congruent in unitary 
geometry ; i.e. unitary geometry is the theory of those relation- 
ships which are invariant under an arbitrary unitary transforma- 
tion, The characteristic property of such transformations 1s 
expressed in terms of the matrix calculus by either of the two 
equations 


§s=1, SS=1. 


Let 9’ be an m-dimensional linear sub-space spanned by 
the linearly independent vectors Q), Mg, * * *, Am: We consider 
a vector ¢ as belonging to the sub-space R” if and only if it is 
perpendicular to K’, i.e. to all the vectors of R: such a vector 
must therefore satisfy the equations 


(a, 2) = 0, (aa X) = 0, sf, (Am £) = 0. 


From these it follows that R” is (n — m)-dimensional. The 
relation between R’ and R” is a reciprocal one: every vector 
of R” is perpendicular to every vector of R and conversely. 
We then have R= R +R", for if the sum yz’ +g” of a vector 
yr’ in R and a vector p” in R” vanishes then y = — g” is a 
vector which belongs to both sub-spaces and is consequently 
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perpendicular to itself, and this can only occur if p =0. A 
` unitary correspondence which leaves R’ invariant will also leave 
R” invariant since the relation of perpendicularity will not be 
destroyed by such a transformation. In dealing wiih unitary 
correspondences or transformations it is therefore always possible 
to find an invariant sub-space R” associated with a given invariant 
sub-space R, such that R= R 4R”. The previous remarks 
about projection suggest that here in the unitary geometry we 
identify the space generated by projecting R with respect to 
i’ with the sub-space R”: we project on to the space R” per- 
pendicular to R’. To this end we remark that among all vectors 
ain R which are congruent mod. Jt’ there is one (a) which lies 
in R”; we then have 


(a-a) = ala), (a -+ b) = (a) + (b). 


With an arbitrary linear correspondence A 


4) —> fy’ = Ay ` Vi = P (4.4) 
of R on itself is, as we have seen, associated a bilinear form 
Š lix Ei Yx 


which depends linearly on a vector £ in P and a vector ġ in 9. 
In unitary space we can therefore associate the form 


A(t, 9) = Z lir Ti Vey 


depending linearly on Yy = (y,) and g = (ž,), with the correspond- 
ence (4.4). It is in fact the scalar product of g and Ay. The 
special case in which 


A=A or A(y,t)= AE, Y) or ari = ay (4.5) 


bears the name of the French mathematician Hermite. The 
correspondence (4.4) is consequently Hermitian if the scalar 
product of y with Ay is the conjugate complex of the scalar 
product of ġ with Ag. On identifying ) with g we obtain the 
“ Hermitian form” 


A(t) = A(t, 1) = Lay Ži Xr (4.6) 


i.e, the scalar product of g and Az; in consequence of (4.5) its 
value is real. An Hermitian form or correspondence A is said 
to be non-degenerate if there exists no vector y, except x = 0, 
whose transform Az vanishes It is positive definite if the value 
of the form A(r) > 0 for all vectors g +0; a positive definite 
form is non-degenerate. 

The fundamental metric form (4.2) is one such positive 
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definite Hermitian form, the “ unit form,” the coefficients of 
which consist of the numbers 

5. — f (t = k) 

(0 (i = Ry’ 


On introducing an arbitrary co-ordinate system a, (i = 1, 2, +++, ) 
into the n-dimensional space, the absolute magnitude of an 
arbitrary vector 

E = Xita F Xaaa + e o + Hy dy 
is given by 

L? = Qin Zikr, Bir = (0; 04). 

The expression for x? is accordingly always a definite Hermitian 
form; conversely, any positive definite Hermitian form G(z) 
could be taken as the fundamental metric form. To show this 
we employ the associated Hermitian bilinear form G(z, 9) to 
carry through the following procedure, which is patterned after 
the step-by-step construction of a Cartesian co-ordinate system. 
Choose any non-vanishing vector e,; since G(e,) > 0 we may, 
on multiplying e, by an appropriate numerical factor, normalize 
it in accordance with the equation G(e;) = 1. When the process 
of constructing a system of unitary-orthogonal vectors e, 


G(€z, €r) = Sip 


has been carried through m steps, 1= I, 2, °° -, m, the next 
step is accomplished by choosing a solution g = êm}, of the 
m <n homogeneous linear equations G(e;, x)= 0 for the n 
unknown components of the vector g + 0 and normalizing it 
in accordance with the equation G(@m.,) = 1. The procedure 
comes to an end after n steps; we then have n vectors 
ei, ĉa, © © *, ên Of such a kind that 


G, £) = ixi + ra t’ tt En Xn 


T = Wy ey + Xgl, + ° ° “+ Xn ên 


It follows from the equations themselves that g can only vanish 
when all of its components x; vanish, and consequently the e; 
are linearly independent and constitute a co-ordinate system 
in R. 

The transition from affine to metric geometry can accordingly 
be accomplished by the introduction of the axiom: 

(ò) The square of the absolute magnitude of a vector y is a real 
number x? which is a positive definite Hermitian form in the 
components of t. 

These last considerations are useful in another connection. 
If R’ is a linear sub-space of R we can employ the construction 


where 


20 | UNITARY GEOMETRY 


used above to find m vectors ¢,, Ca, * * ', êm in R’ which span R’ 
and are mutually unitary-orthogonal in the sense of the equations 
(€,¢,) = ix By continuing the construction we can supplement 
these m fundamental vectors by -— m additional ones 
Omit) ° °°, Ên SO that the two sets together form a co-ordinate 
system for the entire space R. We can therefore adapt our 
normal co-ordinate system to the separation of Rt’ out of R or 
to the decomposition of R = R + R” into two perpendicular 
sub-spaces. 

Since the correspondence A of R on to itself is invariantively 
connected with the Hermitian form A in R, we may speak of 
the product BA of two Hermitian forms A, B in R, but this 
product is not in general Hermitian as 


BA = AB = AB. 


The trace of an Hermitian form or correspondence A is real. 
The positive definite expression 


tr (AA) = Z| airl? (4.7) 


is of particular importance. When § is decomposed into 
mutually perpendicular sub-spaces Ne (a = 1, 2, » » +) the section 
Aag of the correspondence or form A in which Ra intersects Rg 
is uniquely determined ; it is a correspondence of Rt, on Ra, 


and Age, the Ba-section of A, is a correspondence of Ra on Re. 
When the co-ordinate system is adapted to the decomposition 
of R we have 


tr (Aug Age) = tr (Aga Aap) == y| tpl? (4.8) 


where in the sum 7 runs through the a, & through the B® set 
of indices. 

Any non-vanishing vector a determines a ray a which consists 
of all vectors of the form Aa, A being an arbitrary complex number. 
The generating vector a can be so normalized that its absolute 
value |a| = 1; this does not, however, determine a to within 
a change of sign, as in the real domain, as the normalization is 
unaltered on multiplying a by an arbitrary (complex) number ¢ 
of modulus 1. We shall call the totality of vectors of R the 
vector field and the totality of rays the ray field R. Any 
non-degenerate linear correspondence A of the vector field R 
on itself is at the same time a correspondence of the ray field 
R on itself, but this latter correspondence is unaltered by 
multiplication with any non-vanishing number. A unitary 
correspondence or transformation of the ray field on itself will 
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be briefly referred to as a rotation. By the symbol S’ ~ S we 
shall mean that the two transformations S, S’ of the vector 
field on itself differ only by a numerical factor ¢ of modulus 1: 

‘= 2S, whence they both give rise to the same rotation of 
the ray field. | 


§5. Transformation to Principal Axes 


The fundamental theorem on Hermitian forms is that con- 
cerning the transformation to principal axes. We are here 
concerned with the analogue of the familiar problem of finding 
the principal axes of an ellipse or ellipsoid in the ordinary 
geometry of two or three dimensions. We wish to find a normal 
co-ordinate system e; associated with a Hermitian form A(z) such 
that in addition to 


E = Xlr H Xlr T ~ e E LnÊn 
E? = xa + Ža H to H Enta (5.1) 
we also have 
A) = TX, + Raaka F tT Annn ; (5.2) 


that is, A shall be brought into the normal form (5.2) by means 
of a unitary transformation. The real numbers a, %, °° *, On 
are called the characteristic numbers of the form A, and 
ei êz, ° °°, ên the corresponding characteristic vectors. 

To this end we first consider the correspondence x —> y = Ag 
and seek those vectors y +0 which are transformed into 
multiples x’ = Ar of themselves by A. We then obtain the 
‘ secular equation ” 


FA) = det (AL — A) = 0 


for the multipliers A. According to the fundamental theorem of 
algebra this equation certainly has a root A= a,; corresponding 
to it a non-vanishing vector g = è, can be found which satisfies 
the equation Ae, = a,¢@;, and on multiplying this vector by an 
appropriate numerical factor we may take it such that its modulus 
is unity. ¢, can then be supplemented by  — 1 further vectors 
êa, °**, Èn in such a way that these n vectors constitute a normal 
co-ordinate system. In these co-ordinates the formule 


e; = Ae; = Anke 
for the correspondence A require, in accordance with the 


definition of e,, that the coefficients ag, ag; * * *, @,; vanish and 
that a,, == 0, Because of the symmetry conditions a,; = dy, 
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30, Ays, °°‘, 41, Must also vanish. Hence in the new co-ordinates 
the matrix A assumes the form 


O sp 3g ° * * Asn || (5.3) 


a è è> ò aè Ò 5 G k Ò P 


0 Anz Ang °° * Anal 
and the Hermitian form becomes 
A(t) = a8 2, + AY) (5.3) 


where A’ is an Hermitian form containing only the n—1 variables 
Xa ža ‘°°, n Repeating this process, or calling on the method 
of mathematical induction, we establish the validity of the 
fundamental theorem stated above. 

The characteristic polynomial of (5.2) is 


det (AL — A) = (À — ay)(A— aa) © * + (A— an). 


From this it follows that the characteristic numbers æ, 
Xo °° *, Om, including their multiplicity, are uniquely deter- 
mined by the Hermitian form A; their sum is the trace of A. 
What can we say concerning the characteristic vectors? Let 
« be a given real number; the vectors which satisfy the equa- 
tion Ar = ag constitute a linear sub-space R(«) of R, the 
characteristic space belonging to « When the normal 
co-ordinate system ¢; is so chosen that A is in the normal form, 
the equation Ag = of is, in terms of its components, 


OX; == AX; 


from which it follows that R(«) is spanned by those vectors e; 
for which a; = «. If, for example, the three roots 0, @g, %3 = a 
while all the others are different from «, the characteristic space 
R(x) is 3-dimensional. If none of the characteristic numbers 
a, is equal to a, R(«) consists only of the vector 0. This again 
characterizes the characteristic numbers, including their multi- 
plicity, in a way which is independent of the particular co- 
ordinate system chosen, and in addition it characterizes the 
corresponding sub-spaces R(«). R is thus decomposed into the 
characteristic spaces R(a): R= DR(a); only a finite number 


a 
of terms occurs in this sum, i.e. those for which « is a character- 
istic number of A. A complete co-ordinate system @,, @:,° °°, ên 
for the entire space can be obtained by choosing a normal 
co-ordinate system in each non-null sub-space R(«). The 
normal form (5.2) is undisturbed on subjecting the variables 
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Xx, associated with the same characteristic number a, = « to an 
arbitrary unitary transformation. 


If, for example, « is a triple characteristic number 
Oy = Oy = Oy = a 


while the remaining «; «, then xe, + fast Xas is the 
normal projection ga of the vector x on R(«) and 


E(t) = dye, + Fore + Žats 


is the scalar product of ga with itself. The equations (5.1), 
(5.2) may then be written in the invariant form 


= ZEE), A(t) TE are i E,{t). i (5.4) 


R being a sub-space of R, any vector p can be uniquely 
broken up into x’ + to where r’ lies in W and to is perpendicular 
to R. The “orthogonal projection ” g —> y = E’y is a linear 
correspondence which obviously has the property 


E'E' = E’ (5.5) 


for the projection of r’ on R' is simply x’ itself. Furthermore, 
the operator E” is Hermitian, for the scalar product of 4 into x’ 
is equal to the scalar product of y’ into x’, where y’ is the projection 
of pon R’. (The Hermitian form E’(z) is accordingly the square 
of the absolute value of x’.) We shall call Hermitian forms 
which satisfy equation (5.5) idempotent. 

When the sub-spaces ft’, R” are orthogonal, the two corre- 
sponding projection operators EH’, E” satisfy the equations 


E'E" =0, EE’ =0, (5.6) 


for E’ (E’’r) is the component of E”g lying in the space R’ per- 
pendicular to E’y. Idempotent operators which satisfy these 
equations are said to be independent. The second equation is, 
moreover, a consequence of the first, as may be seen on going 


over to the Hermitian conjugate : E"E’ =0. If R is decom- 


posed into several mutually orthogonal sub-spaces HR’ + ee 
then 


c= e+ bat: :. (5.7) 


It is easily shown that the converses of all these assertions 
are also valid. If E’ is an idempotent operator and E” = 1 — F’, 
all vectors of the form E’r constitute a linear sub-space R’ and 
all vectors of the form E’’¢ a sub-space R”. The equation 


E'R"” = E'E" aan E'(1 NE E') — 0 
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shows that the scalar product of a vector E’y in R and a vector 
Ey in R” is zero: ZĒ'E"y=0. The decomposition of a 
vector ¢ into a component lying in fi’ and one perpendicular 
to R’ is accordingly expressed by 


r= Eg + i— E'it. 


If the two idempotent forms F’, E” satisfy the equation (5.6) 
then, as we have just seen, the two corresponding characteristic 
spaces R, R” are mutually perpendicular. If the sum (5.7) 
consists of independent idempotent forms, then by the above 
the corresponding mutually perpendicular sub-spaces W, R” 
exhaust the entire space Ñ. 

The theorem on transformation to principal axes can accord- 
ingly be stated: An Hermitian form A associates with the real 
numbers « mutually independent idempotent Hermitian forms Ea 
such that 


1= SE, A=Ja-&,; (5.8) 


Ea is non-vanishing for only a finite number of values a. 
A correspondence A can be reiterated : 


AA = A, ÆA=A 
and we can accordingly obtain polynomials 
F(A) = Col + yA + cA +++ + + cp A 


in 4 with numerical coefficients c. On reiterating (5.8) h—1 
times 
Ah = SohE, 
a 


whence for the general polynomial f 
fA) = Bfla)Eo. (5.9) 


The characteristic numbers of f(A) are therefore the values of 
the polynomial f(a) for the characteristic numbers « of A. This 
Suggests defining the Hermitian form f(A), where f(a) is any 
es function of the real variable a, by means of the equation 

Given two Hermitian forms A, B, under what conditions can 
they be brought simultaneously into diagonal form, i.e. when is 
it possible to find a normal co-ordinate system in which 


Alr) = Aity + Aaaa Heei + Enfn%n (5 10) 
Bit) can BE, T Peña% a eee Baintn? i 
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A necessary condition is that they commute: BA =: AB, for if 
A and B are in the normal form (5.10) BA as well as AB 1s 
the diagonal matrix with elements B,a,; = «8, This condition 
is also sufficient; to prove this, choose a normal co-ordinate 
system in which A is already in normal form. The equation 
BA = AB requires that the matrix B = ||b,;|| satisfy 


bix Ay == Oy bin or (a, = 04) ix = (5.11) 


We divide the indices z, the fundamental vectors e; and the 
variables x; into classes by considering 7 and k to be of the same 
class if «; = a, Equation (5.11) states that by,=0 when 
i and k belong to different classes. B is consequently decom- 
posed into smaller matrices B’, B” aligned along the principal 
diagonal, corresponding to the way in which the a; are distri- 
buted in classes «’, «’’, +--+; the correspondence B consequently 
leaves each of the characteristic spaces F(a’), R(x), ++- of A 
invariant. But we can then choose a normal co-ordinate 
system in each of these characteristic sub-spaces R(«æ) in such 
a way that the Hermitian correspondences B’, B” in them are 
referred to principal axes; the normal form of A is undisturbed 
by this procedure. 

This process can immediately be applied to any number of 
Hermitian forms: Any number of Hermitian forms can be brought 
simultaneously into normal form if and only if they commute 
with one another. By a slight modification we can further. 
extend this theorem to an arbitrary finite or infinite system X of 
Hermitian forms. This will be briefly discussed here, although 
in general the consideration of systems of forms or correspond- 
ence is postponed until Chap. III. Let the space R be decom- 
posed into mutually perpendicular sub-spaces R’, R”, -+ + in 
such a way that each correspondence of the system 2 takes 
place in these sub-spaces; on adapting the co-ordinate system 
to this decomposition each Hermitian matrix A of 2 consists 
of sub-matrices A’, A”, < + aligned along the principal diagonal. 
If all the A’ are already multiples of the unit matrix 1 in R 
and similarly for all A”, ++ +, our goal is reached, for each corre- 
spondence A of the system then transforms R’ into itself and 
is a simple multiplication in it; similarly for R’,---. But if 
this is not the case let A be a correspondence of the system 
which is not merely a multiplication in the sub-space ft’. On 
transforming the constituent A’ of A to principal axes, R’ is 
decomposed into characteristic spaces Rt,’ + Ry +°- of A’, of 
which there are at least two. For any Hermitian matrix X 
of 2 we have A'X' = X'A’, from which it follows, as we saw 
above, that X’. transforms each of the sub-spaces R’, Re’, °° 
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into itself. The decomposition R + R” +--+ can thus be 
further reduced to the decomposition (R +R,’ +° °°) + 
R” +--+ + Proceeding in this way we finally reach our goal 
after at most steps, proving : 

The Hermitian forms of any system X can be simultaneously 
referred to principal axes if they all commute with one another. 

The theory developed above for Hermitian correspondence is 
valid as tt stands for unitary transformations. S being any unitary 
operator, a normal co-ordinate system e; can be introduced in such 
a way that S carries each of the fundamental vectors e; over into 
a multiple o,; of itself. The characteristic numbers o; of S are 
numbers of modulus 1. In these co-ordinates the matrix of S 
is a diagonal matrix, the elements in the principal diagonal 
of which are the numbers o;. 

The proof is quite analogous. We again start with the 
secular equation 

: det (e1 — $S) = 0 


and consider the root o There then exists a vector ¢, of modulus 
1 which is transformed into o,e, by the correspondence S. Sup- 
plement e, with n — 1 further vectors e, ° + -, ¢, so that these n 
vectors form a normal co-ordinate system. In these co-ordinates 
the matrix ||s,,|| of the correspondence S: 


Se; = Z Ski Ox 
is again of the form 
Su = Oy, Sa =’ = Sm = O. 
Since S is unitary the sum of the squares of the moduli of these 


elements of the first column must be unity, whence |o,| = 1. 


Similarly the sum of the squares of the moduli of the elements 
in the first row must also be 1: 


lo, |? + [s12]? tet [Sin]? =l; 
but since lo]? = ] it follows that 
S =*= Sn = 0. 


The matrix S is now broken up into a l-dimensional o, and 
an (n — 1)-dimensional S’ as in (5.3); the truth of the above 
theorem then follows immediately by induction. 

The further results can be obtained in exactly the same way 
as above for Hermitian forms. The characteristic numbers c, 
including their multiplicity but not their order, are uniquely 
determined by S, and similarly for the corresponding sub-spaces. 
If we wish to find a linearly independent system of character- 
istic vectors, the fundamental vectors of each such sub-space 
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may be taken as forming a normal co-ordinate system. Finally, 
a finite or infinite set of unitary transformations can be simul- 
taneously reduced to normal form if and only if they commute 
among themselves. 


§ 6. Infinitesimal Unitary Transformations 


A rigid body in continuous motion about a fixed point O 
performs an infinitesimal rotation in each interval dr of time. 
Denoting by (dx,, d%_, dx) the infinitesimal displacement of 
that point of the rigid body which is at the point P(x, £a %3) 
at the time r, the equations of motion of the body must be of 
the form 


. dx; 
Xi = ar = Aci Xx (6.1) 


in which the coefficients c;, are constants, 1.e. independent 
of the particular point P under consideration. Employing a 
Cartesian co-ordinate system with O as origin, 7,7 + x? + x3? 
must remain unchanged throughout the motion; this requires 
that 


dx; 
Xx =0 or Ley Kx, = Q. 
i dT ik 


Since this equation must be satisfied identically in the x, the 
matrix C = ||¢,,|| which characterizes the motion must be anti- 
symmetric: Cpi == — Cip Introducing the vector t with origin 
at O and terminus at the point P, and the vector c= (C23, Ca, C12), 
equations (6.1) become 

ay 

dr == [te], 
the familiar fundamental formule for the kinematics of a rigid 
body. The square brackets denote the vector product and ¢ 
the vectorial angular velocity, the absolute value and direction 
of which give the angular velocity and direction of the axis of 
rotation respectively. 

The continuous compounding of interest offers another 
example of an infinitesimal linear transformation. The interest 
rate being c, a real number, the increase in the capital x in time 
dr is xcdr. Radioactive disintegration is the same kind of a 
process with negative c. The capital x, considered as a function 
of the time, satisfies the equation 


L = 6.2 
dr cx (6.2) 
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and consequently increases exponentially with 7. If the prin- 
cipal has the value x) at time + = 0, it will have increased to 


X(T) = xg * e“ 


at time 7. To obtain an alternative solution we divide, as in 
the method of finite differences, the time interval 7 into a large 
number n of equal elements r/n; x will increase by xcr/n in 
each of these intervals and the capital x will accordingly be 
multiplied by (1 + cr/n)" at the end of time 7. The familiar 
definition 
. CTN” 
et = lim (1 + =) (6.3) 
n-> © n 

of the exponential function follows from a comparison of these 
two results. But we can also solve the differential equation 
(6.2) by the method of successive approximations. We take as 
the 0" approximation the initial value xọ: x(r) = xọ The 
(n + 1)st approximation is obtained from the n*® by substituting 


the latter in place of x on the right-hand side of (6.2) and 
integrating : 


Xnay(T) = X% + c{x_(é)dt. 
0 


On carrying out this process we find 


%,(7) = wo] + ii free orl, 


n | 
from which we obtain the familiar power series expansion 


2 
ent En. (6.4) 


for the exponential function. The convergence of (6.3) and 
(6.4) and the identity of their limits is rigorously proved by 
elementary analysis. | 

These examples will assist in understanding the concept of an 
infinitesimal unitary transformation of the n-dimensional 
space R= Ra, which we now proceed to introduce. In order 
to avoid the use of infinitesimals we introduce a (purely fictitious) 
time r and think of the infinitesimal linear correspondence which 


carries the vector ț over into ¢ + dz as taking place in the time 
interval dr: 
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(For the sake of brevity we refer to this simply as “ the in- 

finitesimal transformation C.”) Since the transformation is 

unitary, on employing a normal co-ordinate system $; x; must 
i 


remain unchanged : 
fa + Sx,.5- = 0. 6.5 
Z t dr + ai dr (6.5) 
On setting 
az - . 
T = Š Cik ey = = A Cri fi 
% 
the left-hand side of (6.5) reduces to the Hermitian form 
Z (ir F Cys) B; Xy 
t, 
and since it must vanish identically in the x; we must have 


Cix + ĉr = 0, or the transformation C is anti-symmetric in 
the sense of the equation 


m 


Cin = — Crsy =C. (6.6) 
In the real domain there exists no intimate relationship between 
symmetric and anti-symmetric matrices, but the situation is 
different in the complex domain. For on setting C = iH (i being 
the imaginary unit v — 1) it follows from (6.6) that H satisfies 


the equation H = H, and C is consequently 7 times an Hermitian 
matrix. In an infinitesimal unitary rotation of a vector field the 
. dr. 
velocity Fe is related to x by means of a correspondence whose matrix 
is i times an Hermitian matrix. The theorem on transformation 
of Hermitian forms to principal axes is accordingly the limiting 
case of an analogous theorem on unitary transformations. 
By repeated application of the infinitesimal unitary trans- 
formation 
dg = dr’ Ct (6.7) 
we obtain after time r 
£ -> g(r) = U(r) = eg (6.8) 


where the exponential function e4 for a matrix A can be defined 
by either 
lim (1 + y 
im — 
h—> © h 


A, A 
Pry tate 


or the power series 


Naturally 
U(r + 7’) = U(r) U(r’). 
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Accordingly U(r) runs through all the transformations of a 
1-parameter continuous group of unitary transformations gener- 
ated by the infinitesimal transformation C ; the parameter 7r is 
additive on composition. The power series is obtained by the 
method of successive approximations; this method can also 
be applied to obtain a solution in the more general case in which 
the infinitesimal unitary transformation C is not the same for 
each time element dr, i.e. in which C is a matrix C (T) depending 
on the time 7. The solution of the equation 


ut = C(r)g 
for this case is given by 

Elta) = U(rers)e (rx) ; 
the unitary transformation U(z.7,) which takes place in the 
time interval 7,, 7, obeys the law of composition 


U (rr 1) = U (T3T2) U (Ter). 


If £ = fo at time 7 = 0, the formule for the successive approx- 
imations % (7) are 


tolt) = Eo; Eral) = to + [CWer(i)ae ; 


for U(t) = U(r 0) we obtain the infinite series 5 U, (r) in which | 
Le 0 


Uolt)=1; Uml) = [COU dt. (6.9) 
Written explicitly, ° 


U(r) = ffe JC(t)C(te) -+ + Chdi dtp © © + dh, 
(OS4 545 °°' St) <7) 


The proof of the convergence of this process is readily ob- 
tained with the aid of the quantity | 4 | associated with a matrix 
A = || 4, || by the equation 


|4 P= tr (44) = Zl an |è 


It follows from the well-known Schwarz inequality 


| arb, + aab + +--+ a,b,|? 
<S (|a l? -+ ..- + | an |2)(|b, [2 + s. -+ | bal?) (6.10) 


|4+B| S/Al/4+ IBI 
|AB| S |4| | Bl. 


that 


and that 


REMARKS ON oo-DIMENSIONAL SPACE 31 


The second inequality is obtained by applying (6.10) to the 
element 
Cin = Sir Onn 
of C= AB: l 
| cirl? S D| airl D | brl 
r r 


and summing with respect to t and k. The first inequality may 

be stated in the form 

|40) de | S [| Ale | dt. 
0 


0 
for integrals. The convergence of XU, (7) can now be established 
l 


with the aid of these auxiliary results, for we can prove that 
under the assumption 


|C(t)| Se (0 StS?) 
that 


For this is certainly true for l = 0, and the recursion formula 
(6.9) enables us to conclude that it holds for U;,, if it holds for 
U, The convergence follows from this absolute convergence, 
for the absolute value of each component of the matrix A is 
certainly not greater than | A |. 

We have only gone into these matters to reassure the reader 
of the legitimacy of dealing with infinitesimal quantities of the 
kind met here. The only thing of importance for the following 
is the simple relation existing between infinitesimal unitary 
transformations and Hermitian forms. 


§ 7. Remarks on o-dimensional Space 


The unitary spaces which appear in quantum mechanics 
usually have an infinite number of dimensions. Such a space 
consists of all vectors 

= (%4, Xa," e *) 


whose components x, constitute an infinite sequence of numbers 
for which 


T? = Ky + HX +: 
converges. Within this domain addition and multiplication 


with numbers, as well as the construction of the scalar product 
of two vectors, are possible. All the axioms employed so far 
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are satisfied, with the exception of the dimensionality axiom y 
introduced in § 1. 

Since the vector components %,, %, °° * constitute a de- 
numerable set, this “ Hilbert space ’’ has a denumerably infinite 
number of dimensions. But in addition to these, spaces of 
non-denumerably infinite dimensions may occur. Consider, for 
example, all continuous complex functions #(s) of a real variable 
s of period 27. We need not distinguish between two values of s 
which are congruent mod 2z, i.e. whose difference is an integral 
multiple of 27 ; it is consequently more convenient to consider f(s) 
as a function defined on the periphery of the unit circle than on the 
straight line. The various values of s at points on the circum- 
ference play the rôle of indices, the value ¥(s) at the point s being 
the component of the “vector %” with index s. The totality 
of such functions (s) therefore constitute a linear “ function 
space ” of continuously infinite dimensions. Addition of these 
vectors and multiplication by a number have here the same 
interpretation as in the ordinary operations with functions. 
The square of the absolute value of the vector # is taken to be 


i, p) = [Elod 


and the scalar product of two vectors ¢ and yf as 


$, Y) = [Eds 
A set of functions l 


$4(s), a(s), se Pns) 


constitutes a, unitary-orthogonal system of vectors if 
27 
{ Fi(s)x(s)ds = Sa 
0 


These vectors span an n-dimensional sub-space R, of the oo-di- 
mensional function space, i.e. that sub-space consisting of all ‘ 
vectors of the form 


P(s) = %yha(s) + xapals) ++ + + + Xnbn(s). 


Xi, Xa ° °°, %, are the components in the co-ordinate system 
1, Pa © * ', x Of the vector ¢(s) in Ra. We have 


($, $) = [F(s)$(s)ds = yey F ža te + Kaka. 
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An arbitrary vector ẹ can be broken up into a component $ 
which lies in R, and a component #’ perpendicular to Ray : 


p=, i 
$9) = Bx. dls), SEs = 0. 


It follows from these equations that [cf. (4.3)] 


r = | Hds. 


These integrals are called the Fourier coeficients of the function 
ù$ with respect to the orthogonal system ¢;, The orthogonal 
projection ¢ on R, cannot be longer (i.e. have greater absolute 
magnitude) than ẹ itself; this is the content of the so-called 
Bessel inequality 


an 
Zaty + atin ++ + ntn S J P(s)p(s) ds. on 
0 
In fact, since ($, Y) = 0, (W, $) = 0, the “ Pythagorean theorem” 
(Y, p) = ($, 4) + g) 
holds. 
The simplest unitary-orthogonal system in the domain of 


periodic functions, with which the theory of Fourier series is 
concerned, consists of the functions 


e(ns) In=0,+1,42,°°°; efx) = er] (T.2) 


l 
V Qa 
This infinite system has the property of completeness; it 
is a complete co-ordinate system for the entire function space. 


The theorem that any periodic function #(s) can be expressed 
as a linear combination of the functions (7.2) : 


-- co 27 
Hs) = Fem Si nels), a= a= | Has)¥s)as 


(Fourier expansion of #(s)) is true only if certain conditions 
concerning the differentiability of ¥(s) are fulfilled, but any 
continuous function satisfies Parseval’s equation 


27 ob 
f Bls\(s)ds = X Hatt (7.3) 
. | n= =O 
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We learn from this example that there is no essential distinction, 
between spaces of a denumerable and of a non- denumerable infinitud e 
of dimensions ; we have introduced into our function space 
a complete normal co-ordinate system (7.2) consisting of a 
denumerably infinite set of fundamental vectors. In an m- 
dimensional unitary space a system of unitary-orthogona] 
vectors is complete if their number is n, but not if it is less - 
however, such an enumeration gives no criterion for oo-dimen- 
sional space. If we leave out a finite number of the functions 
(7.2) we still have an infinite set left, but the completeness of the 
system is destroyed thereby. The real criterion for complete- 
ness lies in the validity of the completeness relation (7.3). 

We can understand the relations existing in Hilbert space 
by analogy with or as limiting cases of those existing in spaces 
of a finite number of dimensions. If we consider the values of 
an arbitrary periodic function (s) only at the points 


and set 


r 
we are dealing with an n-dimensional vector space in which the 
components of the arbitrary vector % are these quantities 


é (v=0,1,:° +, —1). Lete, be the vector in this space 
with components 


Jae (=) [V=0,1-¢++,#—1); 
these vectors e, (A= 0, 1, ° - +, n — 1) constitute a normal co- 


ordinate system for the space, relative to which the vector Æ 
has the components £e x1, ° * *, Xa- which are to be calculated 


from 
nl 
1 Qardv 
£= > e( ve 
Aw) 


In accordance with (4.3) 


whence 
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By passing to the limit n + oo we obtain the equation of Parseval. 
We do not concern ourselves here with the further considerations 
which may be necessary to establish a rigorous proof, but content 
ourselves with such reasoning by analogy. 

We consider the linear correspondence or “ operator ” 


ld . . 
D = TI which transforms a function Ņ(s) in the domain of 


ar , . ld . a 
periodic functions into 7 a4 - e(ns) is the characteristic vector 


(characteristic function) of this operator belonging to the 
characteristic number x: 


l de(ns) 
zi T” e(ns). 


This operator is Hermitian; the scalar product of @ and Dy 
is the conjugate complex of that of y and Dd, where ¢ and 4 
are any two periodic functions, for by partial integration 


2n az 
l dp, _ la 
J 36) 4 i — [og ze 
0 0 
and the right-hand side is conjugate to 
Qn 
l dd 
|z PAP FA 
In fact, the Hermitian form 
2r 
lf 7 dh 
aa 
0 
assumes the normal form 
+ 00 
S NEnXn (7.4) 
nam = OD) 


in the normal co-ordinate system whose fundamental vectors 


are the characteristic vectors of the operator D. The reiterated 
2 


operator DD = — & appears in the theory of the vibrating 


string, together with the corresponding Hermitian form 


2x nm 
dry, _ (ap dh 
— |b eas = fs 7545 
0 0 
which represents the kinetic energy of the string. 
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We have here been dealing with a discrete spectrum of char- 
acteristic numbers. But in an oo-dimensional space Hermitian 
forms with a continuous spectrum can also be constructed. 
Consider, for example, the function space consisting of all con- 
tinuous functions 4(s) defined in the interval — rm Ss S4 r; 
the square of the absolute magnitude of the “‘ vector” a is then 


W, p) = FBlopbeas. 


The Hermitian form 
+ 
Aly] = {s¥f(s)y(s) ds (7.5) 


is already in normal form, which shows that it has as character- 
istic numbers all numbers between — 7 and + m. The functions 


(7.2) again constitute a complete normal co-ordinate system in 
terms of which 


+o 
Ws) ~ D, Fa (ns). 


Substituting this in (7.5) we find 
| Ei 
Alb] = SamnEmtny Onn = zie- ms)e(ns)ds. 
The evaluation of 
+r 
fs ‘e[(n — m)s]ds 


yields 0 when n = m and by partial integration 


[seid ag( 


i(n — m) i(n — m) 
when n +m. The Hermitian form 
1 = -m 
= a S 
1 n —- m 
n+ m 


has therefore as characteristic numbers all values between 
— x and +7. 

The characteristic vector %, belonging to the characteristic 
value a (— r < æ < + x) of Af] is that function which vanishes 
at all points s +æ and is there so large that the integral of 
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xpa has the value 1. Of course such a function does not really 
exist, but we can approximate it as closely as we wish. In 
order to arrive at a formulation which is mathematically rigorous 
for the case of continuous spectra, we must introduce in place 
of the idempotent Hermitian form E, in (5.4) the idempotent 
form AE = S E, for the entire interval A= A£ (a SA< 8). 


, aSA<B 
For any given vector £ 


AE(t)=0, ABE) + AEX) = AE) (1.6) 


and the idempotent forms AF associated with two separated 
intervals A are mutually independent. 

In dealing with the continuum, the sum in (5.4) is replaced 
by a Stieltjes integral. Consider the straight line described by 
the real variable A as being covered with a substance, and let 
the amount of this substance on the interval A be denoted by 
Am. We then have, in analogy to (7.6), 


Am = 0, Afm + Atm = Arm. 


If $(A) is a continuous function of position we can construct 
the integral 


foda. (7.7) 


An approximation to this integral can be found by dividing the 
entire interval 0 SA S1 into small intervals ae choosing a 
point A; in A; and evaluating the sum Zl i) © Am. This sum 


then converges to the integral on allowing the A; to approach 
zero. If the distribution has a continuous density 


Am 
lim, ax = 000 


1 
the integral is identical with J $e (AjdA, But the Stieltjes 


integral (7.7) also includes the cases in which there exists no 
finite continuous density ; in particular, it allows the existence 
of discrete points at which a finite amount of the substance is 
concentrated. If the substance is distributed over a finite 
number of points A == a; in amounts m, the Stieltjes integral 
reduces to the sum S'¢(a,)m,. 

t 


We thus arrive at the following more inclusive formulation 
of the fundamental theorem concerning the transformation to 
principal axes: (1) The Hermitian form A associates with each 
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interval A an idempotent form AE(t); (2) when two adjacent 
intervals A,, Ag are added together to form an interval A, 


AE=AE+A,E, 


and the idempotent forms associated with separated intervals are 
independent ; (3) we have 


+0 + 0 
y= dEQ), Alt) = JA Ely). 


In this form the theorem is adapted to the appearance of con- 
tinuous spectra of characteristic numbers, and is particularly 
appropriate for the purposes of quantum mechanics (cf. II, § 7). 
The discrete characteristic numbers lie at those points where 
the monotonic increasing function A4 „E(t) = E(À; x) of A has 
a discontinuity. In our example (7.5) 


B 
APETY] = JP(s)yl(s)ds ; 


here y must be taken as 0 outside the interval (— m, + 7). 


The evaluation in terms of the co-ordinates x, is readily accom- 
plished. 


Consider the function space consisting of the totality of 
all functions (s) of a variable s, which assumes all values from 
— œ to + œ, and which have a finite absolute magnitude 


+00 
(4, 4) = [P(s\p(s)ds, 


i.e. which are “ integrable square.” The characteristic functions 


associated with the linear correspondence yb(s) > ae are again 
the functions e(vs), but the frequency v can now assume all real 
values. The components of (s) are the quantities 


Fourter's integral theorem then allows us to conclude the validity 
of the expansion 


+0 


Hls) = feo 


-0 
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under certain assumptions concerning the differentiability of 
the function ¢(s); but in any case the completeness relation 1 


foley as = [fof 


is valid. We arrive at a somewhat different problem when y 
only require that the functions ẹ(s) be such that loy 
possess a definite mean value 


+a 


Jim al Bisyo(erds = (4, 4); 


this leads to the theory of almost-periodic functions developed by 
H. Bohr.» Here again the validity of the completeness relation 
can be established. 

The theory of the characteristic numbers of Hermitian forms 
in infinitely many variables has been developed by Hilbert and 
Hellinger, but it is applicable only to bounded forms 


A(t) = L liiir, 
i.e. forms whose values have a fixed upper bound when 
r = Paix, Sl. (7.8) 
i 


Indeed, without this assumption we cannot guarantee the 
convergence of A(r) in the entire domain (7.8) ; as an example 
consider the form (7.4), J’n#,x,. That this form only converges 


n 
in a portion of the domain (7.8) is merely another expression of 
the fact that not every continuous function is differentiable. 
The situation is more favourable for unitary forms as they 
satisfy the condition that they be ‘‘ bounded’ in consequence 
of their very definition; a unitary transformation is thereby 
to be taken as satisfying both of the conditions 


UU =1, UU=1. 


The theorem on principal axes has been proved rigorously for 
bounded Hermitian and for unitary correspondences in œ- 
dimensional space. A method due to A. Wintner* seems 
particularly appropriate for dealing with unitary correspond- 
ences; it is based on the consideration of the discrete group of 
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all powers U* of the given unitary transformation U, and deter- 
mines the monotonic increasing function E(A;zx) of the real 
variable A (0 SA < 2r) by means of the equations 


Um) = fed, EA; g) (7.9) 


(the problem of trigonometric moments). F. v. Neumann ® has 
gone furthest in dealing with linear operators for which bounded- 
ness is not postulated. In accordance with § 6 with a Hermitian 
form A is associated a group of unitary correspondences e'*4== U(r) 
depending on the real parameter r and satisfying the equation 


U(r + r) = U(r) U(r’) ; (7.10) 


the study of this group is equivalent to the study of A. It is 
therefore perhaps appropriate to replace this latter for o- 
dimensional space by the former, for no convergence difficulties 
appear in the domain of unitary transformations. We must 
therefore attempt to bring the operators U(r), which are con- 
tinuous functions of the real parameter 7 satisfying (7.10) 
simultaneously into the form 

an 

U(r; g) = Jed, EQ; t). (7.11) 

0 
This is accomplished with the aid of Wintner’s method on re- 
placing the discrete parameter n in (7.9) by the continuous 
parameter 7. The problem (7.11) bears the same relation to 
(7.9) as Fourier’s integral bears to Fourier series. 

In setting up a system of axioms for oo-dimensional vector 
space the axioms («), (8) of § 1 and the metric axiom (8) of § 4 
can be retained; for the proper substitute for the dimension 
axiom (y) see, e.g., v. Neumann, ‘‘ Mathematische Begründung 
der Quantenmechanik.” $ 

The algebraic and geometric tools developed in this chapter 
offer a natural medium for the expression of quantum mechanics ; 
they already hold a dominating position in the classical physics 
of continuous media. A masterly exposition of their mathe- 
matical content and application is found in the first part of 
Courant-Hilberi’s ‘‘ Methoden der mathematischen Physik,” 
2nd ed. (Berlin, 1930). 


+ 


CHAPTER II 
QUANTUM THEORY 


§ 1. Physical Foundations ! 


HE magic formula 
E = hv (1.1) 


from which the whole of quantum theory is developed, establishes 
a universal relationship between the frequency v of an oscillatory 
process and the energy E associated with such a process. The 
quantum of action A is one of the universal constants of nature 


h = 6-547 X 107?" erg secs, 


It was first discovered by Planck at the turn of the century in 
the laws of black body radiation ; that is, radiation which is 
enclosed in a cavity and is in thermodynamic equilibrium with 
matter of a definite temperature, which by emission and ab- 
sorption causes an exchange of energy between the various 
frequencies contained in the radiation. Since this equilibrium 
is independent of the particular nature of the matter involved, 
Planck considered, as a kind of schematic matter, a system of 
linear oscillators of all possible frequencies. A charge oscillating 
with frequency v interacts with the electromagnetic field by emitt- 
ing and absorbing radiation of the same frequency. Planck as- 
sumed that the exchange of energy took place in integral multiples 
of an energy quantum £; he at first considered this assumption 
merely as a mathematical device, and intended to pass to the 
limit e:= 0, In order to obtain agreement with the Wien 
displacement law, which was derived from general thermo- 
dynamical principles, the energy quantum associated with a 
definite frequency v must be taken proportional to v: e = hv. 
In this way Planck obtained his radiation formula, which is in 
excellent accord with observation ; according to it the amount 
of energy contained per unit volume in the spectral interval 
v, v + dv in thermodynamic equilibrium at temperature 6 is 


8rhvidv 
4l 


(1.2) 
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where c is the velocity of light and k the Boltzmann constant 
(2k@ being the mean energy of an atom of a monatomic gas at 
temperature 6), On passing to the limit h = 0 we obtain the 
Rayleigh-Feans radiation law 


8 2 
u(y) = al "kô. 
c 
The assumption of the validity of this latter law for the entire 
spectrum is in gross disagreement with the facts, as it would 


lead to an infinite value for the total energy [u{v)dv ' a state of 


equilibrium would therefore be impossible with given finite 
energy. 

The idea of a quantized exchange of energy, which occurs 
in Planck's derivation somewhat schematically and only in 
application to statistical thermodynamical consequences, was 
first seriously applied to individual atomic processes by Einstein. 
In 1905, guided by the observations of H. Hertz, Hallwachs 
and Lenard on the photo-electric effect, he enunciated the idea 
of a light quantum or photon as ‘‘ an heuristic viewpoint con- 
cerning the generation and transformation of light ’’* according 
to which not only the exchange of energy between matter and 
radiation of frequency v occurs in quanta of amount hy, but 
further, light of frequency v can exist in the ether only in quanta 
of energy hv. The decisive experiments were first performed 
by Millikan ten years later. By allowing ultra-violet or X- 
radiation of frequency v to fall on a metal plate electrons are 
released whose kinetic energy (as was already known to Lenard) 
increases with the hardness (i.e. with decrease of wave-length) 
of the incident radiation; the energy with which the electrons 
are emitted is, however, not influenced by the intensity of the 
radiation. The exact relation predicted by Einstein is 


2 
bw — P=" =e 


where — e, m and v are the charge, mass and velocity of the 
electron, respectively. The energy kv of the photon is trans- 
formed into kinetic energy of the electron, after subtracting 
from it the work P required to pull the electron out of the metal 
surface. If the potential difference between the metal surface 
and a plate placed in front of it is V’ the electron current will 


. ws h 
disappear as soon as V’ exceeds the critical value V, = = 
Millikan found that the potential at which the current vanished, 


obtained by extrapolation, was in fact exactly proportional to 


PHYSICAL FOUNDATIONS 43 


the frequency v for monochromatic light of vafious frequencies, 
and that the constant of proportionality was equal to the 
quotient of the obtained by Planck from black body radiation 
and the elementary quantum of electric charge e. The differ- 
ence of the mean energy P for two different metals is furthermore 
equal to e times their contact difference of potential. The 
value of P, or at least its order of magnitude, is therefore known, 
and we find that for X-rays of a few Angstréms wave-length 
(1A = 10-8 cm.) P is negligible in comparison with Av. The 
equation 
2 

hy = > = eV (1.3) 
governs not only the generation of secondary cathode rays by 
primary X-rays, but also the inverse process: the transformation 
at the glass wall or on the anode of the incident cathode rays 
into the impulse radiation first observed by Röntgen. If an 
electron which has run through the potential drop — V in the 
X-ray tube loses its entire energy on collision, a photon of fre- 
quency v and energy hv = eV will spring into existence. The 
electron may, however, only be slowed down; consequently 
v is only the upper limit for the frequency of the impulse radia- 
tion, which will therefore consist of a continuous spectrum with 


a sharp limit at v = K, The old classical theory of radiation 


was entirely unable to account for this most characteristic 
property of the impulse radiation. The frequency of the limit 
increases in proportion with the applied potential—and this is 
the exact formulation of the fact that “ the higher the potential, 
the harder the rays ” so familiar to every X-ray operator. 

The observed phenomena thus confirm the hypothesis that 
radiation of frequency v can be absorbed and emitted only in 
quanta of energy hv. This hypothesis will of course have further 
consequences for the theory of the structure of matter. The 
Planck oscillator will, for example, be unable to alter its energy 
continuously since it can only emit or absorb these fixed quanta 
of energy, and it will consequently spring to and fro on the rungs 
of its energy ladder, which are equally spaced at intervals hy ; 
v 1s here the frequency of the oscillator, a constant determined by 
the constitution of the oscillator. An application of the essential 
elements of this idea to actual atoms gave rise to the frequency 
rule enunciated by Niels Bohr (1913) : ~ 
=~ An atom can exist only in certain discrete stationary states 
(“ quantum states °) in which it does not radiate. Light will be 
emitted on transition from one state into another ; the energy which 
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it loses in this transition, the difference Ey — Es of tis energy in 
the two states, will be transformed into a photon of energy hv, the 
frequency v of which 1s determined by the equation 


hv = E, — Ez. (1.4) 


In this equation E,, E, may be any two of the discrete energy levels 
(E,>E,). Conversely, in absorption a photon raises the atom from 
the energy level E, to a higher E, by giving up its energy hy to the 
atom. 

According to classical electrodynamics an atom should 
continually emit radiation in consequence of the vibrations of 
its constituent electrons, and the frequencies of the emitted 
light should agree with the frequencies of the simple oscillations 
into which the motion of its electronic system can be resolved. 
But the atom will itself lose energy through this radiation, the 
motion of its electrons will thereby be modified and the fre- 
quencies will consequently be displaced. This entire point of 
view is therefore irreconcilable with one of the most fundamental 
physical facts: the existence of sharp spectral lines. On the 
other hand, Bohr’s assumption is not only in agreement with 
this fact, although it offers no such detailed picture of the 
reaction between matter and ether as the classical theory, but 
contains in addition the fundamental Rite-Rydberg combination 
principle. If we order the energy levels in an increasing series 
E< E< E <*' +, then in accordance with (1.4) each 
frequency v is the difference of two “ terms’ v, = E,/h, 


v(t —> k) = Vy — V; (i > k). 


Consequently there will occur in addition to the frequencies v(i —> k), 
v(k —> I) the frequency 

v(i —> 1) = v(i > k) + v(k > J) (1.5) 
obtained from them by addition. This combination principle is 
valid without exception in the whole of spectroscopy, in the 
optical region as well as in that of X-rays, and has proved to 
be a valuable guide in the classification of spectra; it reduces 
the complex line spectra to the simpler term spectra. Un- 
fortunately the problem is made more difficult by the fact that 
not all lines corresponding to possible transitions i —> k need 
actually occur—not every term v; need ‘‘combine”’ with a 
given term y,—for the conditions of excitation may be such 
that certain lines have zero intensity. The selection rules for 
the allowable transitions will therefore be contained in the 
rules which determine the intensities of spectral lines. The 
combination principle, or the Bohr frequency rule, determines, 
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so to speak, only the keyboard of the spectrum—which tones 
are really struck is dependent on the mode of excitation. But 
it will in general be possible under proper conditions of ex- 
citation, e.g. the influence of strong external electric fields, to 
bring out the lines which are not observed under ordinary 
conditions. 

In the “' unexcited” or normal state the atom is in the stationary 
state of lowest energy Ey, and consequently only the lines of the 
"series ” n —> 0, of frequency va — v (n = 1, 2, +--+), occur in 
absorption. The lowest of these 1 > 0 (i.e. with greatest wave- 
length), or more precisely the lowest which is not forbidden by 
the selection rules, is called the ‘‘ resonance line.” 

The simplest atom is that of hydrogen ; in it a single electron 
of charge — e revolves about a nucleus of opposite charge + e. 
The terms of the spectrum of atomic hydrogen are found by 
observation to be given by the equation 

Vn R 
Co w (1.6) 
where R = 109700 cm.~! is the Rydberg constant (spectroscopists 
are accustomed to give the wave number v/c, the reciprocal wave- 
length, instead of the frequency v). The energy levels corre- 
Rhe 


sponding to these frequency terms are En = — 7 To this 


discrete term spectrum we must add the continuous spectrum 
E20; the additive constant in the energy is so chosen that 
E = 0 separates the hyperbolic electron orbits from the elliptic. 
The Balmer series consists of the lines n > 2 with wave numbers 


This is the oldest known series formula; Balmer obtained it in 
1885 by abstraction from the first four lines of the series, called 
Hz, Ha; Hy, Hs, which lie in the visible region. The lines of 
this series converge with increasing n to a limit with wave 


number $ (wave-length = 3650A ). cRh is the work required 


4 

to ionize an H-atom in the stationary state n = 2, i.e. the work 
required to remove the electron from such an atom without 
leaving it with kinetic energy. The continuous spectrum, 
arising from transitions which ionize the atom, will join on to 
this series limit on the short wave side. We are further ac- 
quainted with the Lyman series n —> 1 which lies in the ultra- 
violet and also occurs in absorption, the Paschen series n —> 3 
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lying in the infra-red, and finally with some members of the 
Brackett (n — 4) and Pfund (n — 5) series in the far infra-red. 
In order to ionize hydrogen in the normal state an amount cRA 
of work must be done ; the corresponding “ ionization potential,’’ 
i.e. the potential difference an electron must traverse before it 
is able to ionize atomic hydrogen by means of its kinetic energy, is 


V = i = 13-53 volts. 


Bohr's frequency rule goes beyond the combination principle 
in asserting that the terms are actually energy levels, an assertion 
irrelevant to and not verifiable by spectroscopy. That this is, 
however, in fact the case is confirmed by the experiments of 
Franck and Hertz on collision phenomena.* In these experiments 
electrons are given an amount eV of kinetic energy by allowing 
them to pass through an electric field of known potential differ- 
ence — V and are then allowed to pass through a gas consisting 
of the atoms which are to be investigated with the velocity thus 
obtained, without further influence from external fields. The 
electron can give up no energy to the atom until eV is greater 
than the excitation energy E, — Ee of the resonance line; if 


Fi,—E,< eV < E — Ep 


then the electron can either suffer an “elastic collision,” in 
which case it loses no energy, or it can suffer an "inelastic 
collision,” in which case it loses an amount E, — E, to the 
atom. The electrons which have passed through the gas are 
of two kinds, those with kinetic energy eV and those with 
eV — (Ei — E). When the atoms which have been raised 
from the state 0 to the state 1 by collision with electrons fall 
back into the normal state they emit the resonance line and, 
under the above conditions, only this line. This is fully con- 
firmed by the experiment. The kinetic energy of the emerging 
electrons is measured by introducing a retarding potential V” ; 
the electrons only come through it if their energy is greater 
than eV’. In general the electrons possess a discrete “ energy 
spectrum ” after collision with an atom of the gas; the possible 
energy values are 


eV, = eV — (En — E,) 


(n = 0, 1, 2,+--, in so far as V,’ is still positive: we here dis- 
regard the possibility that a single electron may suffer more than 
one inelastic collision). On allowing the retarding potential V’ 
to decrease gradually from a value which is greater than V the 
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electron current decreases suddenly whenever V’ passes through 
one of the values V3, Vist °° 

Bohr’s frequency rule reduces the determination of spectra 
to the problem of obtaining the stationary states and the correspond- 
ing energy levels of an atom, i.e. of a mechanical system of known 
dynamical constitution. The example of the linear oscillator 
given above and the fundamental notions of the theory of 
oscillations suggest the following as a general guiding principle 
(P): the frequencies derived from the energy levels by means 
of Bohr’s frequency rule shall correspond to the frequencies of 
the simple vibrations into which the actual motion of the atomic 
constituents can be resolved in accordance with the laws of 
dynamics. Such a resolution into simple oscillations is con- 
vincingly attainable in classical mechanics only if the system 
is “ multiply ” or “ conditionally periodic,” and for this case it 
was actually found possible to sharpen the general principle (P) 
into a definite rule for quantization. In the years 1918-25 the 
application of this quantum rule yielded a great harvest of 
results, and it seemed that we were in possession of the key that 
would unlock the mysteries of atomic processes. But the wards 
did not quite fit; toward the end of this epoch its failure became 
more and more apparent and the physical theory was gradually 
reduced to a symbolic calculus of quantum numbers which had 
to be corrected each time a new fact was discovered. We do 
not wonder now that it ran such a course, but rather are surprised 
that it was as successful as it was ! 

From the beginning the quantum rules were a compromise. 
If a mechanical system of one degree of freedom undergoes a 
periodic motion the frequencies v of the simple vibrations into 
which its motion can be resolved are integral multiples of a 
fundamental frequency w. This frequency depends on the 
energy of the orbit under consideration, and this latter is re- 
stricted by the quantum rules to the discrete set £,. The 
internal frequencies of the motion are therefore given by the 
formula 


v= k- wln) (1.7) 


which depends on the two integers n and k. By the analogy 
with quantum mechanical frequencies this internal frequency 
(1.7) is to be ascribed to the jump n > (n —k). The fact that 
v depends linearly and homogeneously on the jump & is expressed 
by the “ classical combination principle ” 


v(n—> n — k) + y(n => n— l) =v(n>n—k— h (18) 
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in consequence of which frequencies with the same initial state 
n will combine. But this is not in accord with the correct 
combination principle 


vin >n — k) +r(n—k>n—k—I) =vn>+n—k — I) (1.9) 


The changes k, l in the quantum number are here the same as 
in (1.8), but the final state n — k of the first frequency coincides 
with the initial state of the second; only for quantum numbers 
n which are large compared with k and / does the classical 
principle agree asymptotically with the Ritz-Rydberg com- 
bination principle. Consequently if the general principle (P) 
is to be satisfied without compromise our mechanics must be 
altered in such a way that the false combination principle (1.8) 
is replaced by the correct one (1.9). In 1925 Heisenberg dis- 
covered a way in which such an alteration can be naturally 
accomplished; in order to do this, however, it was necessary 
to give up the picture of an atom with its electronic orbits- 
The quantities with which the Heisenberg theory deals are 
only the frequencies and intensities of radiation associated with 
transitions between the various states of the atom. 

It should be observed that the correct combination principle 
(1.9) is in one important respect simpler than the false one (1.8)- 
As the formulation 


vin? > n') + v(n' > n) = v(n"'~> n) (1.10) 


shows, the quantum numbers serve only as distinguishing marks 
or indices which do not involve a law of composition, whereas 
the classical formula requires the addition of quantum numbers, 
which are therefore numbers on a definite scale. 

Another approach to quantum mechanics was discovered 
by L. de Broglie and E. Schrédinger.* This approach seems to 
me less cogent, but it leads more quickly to the fundamental 
principles of quantum mechanics and to the most important 
consequences for experimental science. We shall therefore 
follow it, since we are more concerned in giving a short but 
comprehensive account than in giving a complete discussion of 
the physical foundations. The physical, essentially statistical, 
interpretation of the theory, with which Schrödinger has not 
been entirely in accord, is due mainly to M. Born. 


§ 2. The de Broglie Waves of a Particle 


We consider the undulatory character of light as guaranteed 
by the phenomena of diffraction and interference. Their most 
decisive feature is that with them we are dealing with the linear 
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superposition of waves with arbitrary differences of phase. From 
the mathematical standpoint, they are characterized by the fact 
that they involve addition and multiplication with complex 
numbers, and we are consequently dealing with vectors in a 
complex space. We can, in fact, consider a complex function 
y(t; xyz) employed in the description of the phenomena and 
defined over time and space as such a vector, where each space- 
time point represents one dimension of a complex vector space; 
the differential laws for such a wave function %—or for several 
such functions simultaneously, such as the components of the 
electric and magnetic field strengths—are linear and homo- 
geneous. But on the other hand the quantum phenomena 
which we discussed above speak just as plainly in favour of 
the corpuscular nature of light. The intensity of the mono- 
chromatic radiation employed in the production of the photo- 
electric effect has no influence on the velocity with which the 
electrons leave the metal; it influences only the frequency of 
this event. Even with intensities so weak that on the classical 
theory hours would be required before the electromagnetic 
energy passing through a given atom would attain to an amount 
equal to that of a photon, the effect begins immediately, the 
points at which it occurs being distributed irregularly over the 
entire metal plate. This constitutes a proof of the existence of 
photons which is no less direct than the proof that «-particles are 
of corpuscular nature by observing the scintillations caused by 
them on striking a sensitized screen. Further, if one considers 
the exchange of momentum in addition to that of energy in 
deriving the laws of black body radiation, conflict with Planck's 
hypothesis concerning energy quanta can be avoided only by 
assuming that in addition to the emission of the energy quantum 
hv a quantum hv/c of momentum is emitted in a definite direction, 
producing an equivalent reaction on the atom.* We here replace 
the continuous radiation of a spherical wave by the discontinuous 
emission of photons in definite directions which are irregularly 
distributed over the compass. 

We unite the two standpoints by retaining the linear wave 
equation, but considering the intensity i Y as the relative probability 
that the photon appears at the point (x, y, z) at time t; or, more 
precisely, that 


bb dxdydz (2.1) 


is the probability that at time tit will be found within the small 
parallelepiped with sides of length dx, dy, dz about the point 
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(x, y, 2).* But we can only expect to arrive at a rational theory 
if we deal with material particles in the same way as with photons. 
This point of view was developed in the Bose-Einstein treatment 
of an atomic gas, which paralleled that employed in the theory 
of black body radiation (“light quant gas”’).6 Schrédinger'’s 
researches took as their point of departure the Hamiltonian 
theory of mechanics, which was originally obtained by Hamilton 
himself from an analogy with geometrical optics. He argued 
that since we replace geometrical optics, with the aid of which 
interference and diffraction cannot be treated, by wave optics, 
it is reasonable to attempt the analogous transition in mechanics. 
The results amply justified the attempt. The investigations of 
Davisson and Germer, which prove the existence of interference 
in beams of electrons reflected from a crystal lattice, were already 
in progress when de Broglie published his theory. The experi- 
mental evidence that moving material particles behave in much 
the same way as a beam of light with respect to these phenomena 
is now fully established, and with no less certainty than for 
X-rays, by a series of further investigations by the same 
authors and by G. P. Thomson, F. Rupp and others.” The 
real difference between ‘‘ light-like ” and ‘' electron-like '’ beams 
lies in the fact that the particles composing the latter possess 
charge and proper mass and can consequently be deflected by 
electric and magnetic fields. 

A simple oscillation is one in which the function y, defining 
the state of the system, depends on the time in accordance with 


the law 
y(t) = a> e” (2.2) 


where a and v are independent of t. [We choose as our unit 
of angular measure that one which proves most useful in differ- 
ential calculus, for it yields the simple relation 


l de(x) _ 


for the fundamental trigonometric function e'* = e(x). The 
sum of the angles about a point is then 27; it would, admittedly, 
be more correct from the integral standpoint to take this as 1, 
but then the factor 2r would appear in the differential relation. 
v/2m is the number of oscillations in unit time; we shall not 


* Just as in the classical wave theory we have an expression for the flow 
of energy in addition to its density, so in the more refined formulation of 
quantum theory we will have an expression for the probability that the 
photon passes through a given element of surface (‘‘ probability current’’) in 
addition to one for the probability that it be found in a given element of 
volume (“probability density ”). 


~a 
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hesitate, however, to use the name ‘‘frequency’’ for v. If we 
denote Planck's constant of action by 2rh instead of h, and we shall 
throughout the present work, the fundamental formula (1.1) 
will still be valid in the new nomenclature.} In accordance with 
(2.3) the simple oscillations (2.2) are the characteristic functions 
of the linear Hermitian operator which carries ẹ over into 


hd . SD 
3 wv ; the corresponding characteristic numbers are the 
energies E = hy. If the dependence of a state of the system on 
time is described by a superposition of simple oscillations 


W(t) T ar emt -+ ao ew tad + . . E (2.4) 
the energy is capable of assuming only one of the values hy,, 
hva, - > +, and we shall take the intensity @,a, = |a,|? of the 


oscillation of frequency v, in w as the relative probability that 
the energy is observed to be hy, The relation E = hy is accord- 
ingly to be interpreted: if v is indeterminate because an entire 
spectrum of frequencies v 1s contained in the oscillatory process, then 
the energy is indeterminate to the same extent; the intensities 
with which the various simple oscillations occur in the process 
measure the probabilities of the corresponding energies. The 


h d 
operator — 7G represents the energy : 


at 
hd 

H— hk (2.5) 
in the following sense: a characteristic function of (2.5) represents 
a state in which the energy assumes a definite value E with certainty. 
This value is the corresponding characteristic number ; in an 
arbitrary state the components a of with respect to these character- 
istic functions determine the relative probabilities aa of these 
values E. 

According to the theory of relativity energy is to be con- 
sidered as the time component of a 4-vector whose spatial com- 
ponents constitute the linear momentum p = (Pe, py, pz). The 
fundamental metric invariant of the two vectors running from 


ttf 


the origin to the points (t, xy 2), (t, x’y’z’) is the scalar product 
ctt — (xx + yy’ + zg’). 
Under a Lorentz transformation, which transforms from one 
space-time co-ordinate system to another equally permissible 
one, the quantities 
ct, — x, — y, — z 

must consequently transform contragrediently to £ xyz; they 
are therefore the components of the vector associated with 


ROE}: 


om, 
sl 
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(£, x y 2) in the space which is the dual of the 4-dimensional space- 
time world. Such a dual vector is given by 


H, — Pa, — Py, — Pz; 


or, what amounts to the same thing, 
Hdt — (padx + p,dy + pdz) 


is invariant under Lorentz transformations. The same is true 
of the total differential operator 


d= dt + (Šis + Siy += ° ds) 


applied to an arbitrary function of ¢; x, v, 2. Hence the corre- 
spondence (2.5) necessarily implies the further relations 


hd h d hd 


Pes yp Pu i dy’ Pr > Fp (2.6) 
which are to be given the analogous interpretation. 


A homogeneous plane wave 
$ = as eî (— rt + ar + By +yz) (2. 7) 


is simultaneously a characteristic function of the four mutually 
commutative operators (2.5), (2.6), which has as characteristic 
numbers 


H = hy ` Da = ha, P, = h, P: = hy. (2.8) 


It represents a state in which the energy and linear momentum 
of the quantum possess these sharply defined values. 

In classical mechanics the laws governing the motion of a 
particle are known as soon as we express its energy H in terms 
of the “canonical variables” xyz, pepyp, In Newtonian 
mechanics the Hamiltonian function for a free material particle 
of mass m is 


Pet Py tpr. (2.9) 


“om 


on employing the transition scheme developed above we obtain 
the corresponding wave equation 


h oy h? p oF SS F 


(2.7) is a solution of this equation provided the values (2.8) of 
energy and linear momentum satisfy equation (2.9); in this 
sense (2.9) and (2.10) are equivalent. But the equation (2.10) 
is linear and has as its most general solution a linear super- 
position of simple waves (2.7); such a superposition corresponds 


(2.10) 
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to a state in which the energy and momentum of the particle 
assume their various permissible values “ with a certain definite 
probability.” 

The space vector (a, B, y) in (2.7) gives the direction of 
propagation of the plane wave, and the modulus of this vector 
is the wave number p (the number of waves contained in 2r 
units of length; 2m/u is the wave length à). Hence by (2.8) 


the absolute value p of the momentum is equal to hp mt 


À 
” is the phase velocity of the wave; in accordance with (2.9) or 
h 2 
” = Im 


it is hu/2m = hr/Am and depends on the wave length or frequency 
(dispersion). Since p = mv, where v is the velocity of the 
dy =% 
dp 


ae 


particle, the ‘ group velocity ” =v coincides with the 


velocity of the particle. Experiments on diffraction and inter- 
ference phenomena in electron beams, such as those performed 
by Davisson and Germer, have made it possible to test directly 
these relations set up by de Broglie. 

In relativistic mechanics we have in place of (2.9) an equation 
which states that the square of the absolute value of the energy- 
momentum 4-vector is constant and equal to m?c? : 


H? 
za (Pa t Py + Pz) = me? (2.11) 
or , 
H = ca/m*c? + (pe + pi + pi). 


For the transition to a wave equation it is of advantage to employ 
the rational form (2.11) of this expression : 


1 db mc? 
-3 S2 -+ Ais = = FrN . h. (2.12) 


Here again the group velocity is equal to the velocity v of the 
particle, but the phase velocity is found to be c?/v; the former 
is always less, the latter always more than the velocity of. light. 
In order to return from the relativistic to the ‘ ordinary ”’ 

Newtonian mechanics by passing to the limit c > œ, we must 


first replace H by mc?-+-H, i.e. must be replaced by (2 e) 


The differential equation governing light waves can be ob- 
tained from (2.11) by dropping the term on the right-hand side. 
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Hence from the corpuscular standpoint light consists of photons 
or particles of proper mass 0: 

H? 2 2 2 

e (pz + py + Pz) = 0. 

In accordance with the expression (2.1) for the probability 
density, we are to consider as the vector in unitary System-space 
describing the state of the system the function 4 in so far as it 
depends on the spatial co-ordinates xyz. The integral of (2.1) 
with respect to the spatial co-ordinates gives the probability 
that the particles will be found “ within the volume V at time 4.” 
Space and time must be separated from one another; the system 
has at each time t a definite state (xyz), which will in general 
vary with £. The operators which represent physical quantities 
must accordingly be ones which operate on an arbitrary function 
of the spatial co-ordinates. This requirement is satisfied by 
the operators (2.6) corresponding to the momentum co-ordinates, 
but not by differentiation with respect to time, which we have 
associated with the energy. We must instead consider the 
situation as described as follows: from the expression for the 
energy in terms of the canonical variables fe, Py, Da we obtain 
the operator which represents the energy and which operates 
on the function (xyz). The equation 


is then the dynamical law which determines the change in the 
state % in time. 

The separation of space and time offers certain difficulties 
to the development of quantum theory from the relativistic 
standpoint ; consequently, for the present, we base our develop- 
ment on the Newtonian mechanics. 

Our procedure must eventually be modified in another 
important respect: we have here tacitly assumed, for the sake 
of mathematical simplicity but without physical justification, 
that the wave field of a material particle is described by a scalar 
quantity y. The modification, which is required in order to 
give an adequate description of the facts of spectroscopy, will 
be made in Chap. IV. 


§ 3. Schrodinger’s Wave Equation. The Harmonic 
Oscillator 


When the particle is moving under the influence of forces 
the kinematic part (2.9) of the energy is augmented by the 
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potential energy, which usually depends on the co-ordinates 
alone and not on the momenta. We must therefore know 
which Hermitian operator acting on ẹ corresponds to the co- 
ordinate x. I assert that it is multiplication by x; this operator 
is already referred to its principal axes, its characteristic values 
are all real numbers x and finally y(x), or more precisely p(x) V dx, 
is the component of the “ vector ”’ assaciated with the character- 
istic number x (we have here ignored the other co-ordinates y, z). 
In accordance with the statistical interpretation of the relation- 
ship between physical quantities and operators, our assertion is: 


the probability that x has a value between x, and x, is | dabde ; 


this is in agreement with the expression (2.1) for the probability 
density. If V(xyz) is a function of position in the 3-dimensional 
space, e.g. the potential energy, then the physcial quantity V 
is represented by the operator 


ý > V (xyz) +p, 
for the probability that V lies between V, and V; is given by the 


integral 
| J | Pbdxdyde 


extended over that portion of space in which V, S V(xyz) SV). 

The operators corresponding to x, y, z commute with each 
other, but the operator Q corresponding to x and the operator 
P corresponding to p, do not. In fact 


S eble)] — Ll) = ole 
or Po— 0P ="1 


where the 1 on the right-hand side stands for the operator 
identity: (x) > d(x). Because of this non-commutative re- 
lation between the operators P and Q, pẹ cannot assume a definite 
value with certainty when x does, and conversely. In fact, if p, 
is known to have the value ha with certainty, then the dependence 
of % on x is given by the factor e'**; in consequence of this the 
position x of the particle is entirely indeterminate, since the 
probability yx of localization is the same for all points x. 

If V(x, y, g) is the potential energy of the field in which the 
particle moves, the total energy is 


2 2 1 pe 
H= Pet Pet Pr Tle, y, 2) (3.1) 
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We assume with Schrédinger that in spite of the fact that all 
our variables do not commute we may still apply our rules for the 
formulation of the wave equation; we thus obtain Schrédinger’s 
differential equation 


hoy k = 0. 

Op Fy hh t Ve) Y 
We understand by ‘‘ stationary ” or ‘“‘ quantum states ” & those 
in which the energy E has a definite value ; they are character- 
ized as solutions of the wave equation which satisfy in addition 
the equation [cf. (2.5)] 


On setting E = hy, such a & will have the form ew" ‘a where 
the new function denoted by # is independent of t. This function 
(xyz), which depends only on the spatial co-ordinates, satisfies 
the reduced equation 


3-A + [E — Viya = 0. 


The problem is thus reduced to finding values of E and functions 
y + 0 of position which satisfy this equation and are such that 
the integral of $y over the entire space is finite. They are the 
characteristic numbers and characteristic vectors of the Hermitian 
operator H associated with the energy (3.1) in the function space 
of all functions of position %. The characteristic numbers E 
are the possible energy levels of the particles. 

Before going any further into the interpretation of the theory 
we have developed, it will be well to convince ourselves that it 
leads to energy levels which are in agreement with the facts. 
The simplest example is that of the linear oscillator ; with it 
we are dealing with only one co-ordinate x. The potential 


energy is V(x) = se and the total energy 

— 1/?? 2 
The equation for the determination of the characteristic values 
È and the associated characteristic functions yb is 


page + (E— Se") = 0. (3.3) 
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Hermitian polynomials. The solutions of this equation are 
expressed in terms of Hermitian polynomials. The n Her- 
mitian polynomial »,(x) is defined by the equation 


d” 
Tarl 
it is of n™ degree and the highest term is exactly x". The 
Nn(x) (n = 0, 1, 2, + + +) constitute an orthogonal set of functions 
with the “ density function * e712: 
+00 


je? Na(*)Nm(x)d% = 0, m + 1; (3.5) 


ae) = (— 1) "e+ ale); (3.4) 


the functions 


Pala) = h n(x) 


are consequently orthogonal in the ordinary sense. To prove 
this we need merely to note that 


oO 
aq", _ 
(— | le *P) + n(x) ax 
becomes, on integrating n times by parts, 
+ 00 
| ee 2. dN m(%) y 
dx” 


— 00 


and the integrand vanishes for m < n. For m = n we obtain 
+ 0 
n | (eda 
~- oO 
so the equations (3.5) can be supplemented by 


~+- 00 
ya = fe-hn 2(\dx = n! Vm. 


~ O 


x 


From (3.4) we have 
a qntt , 
gu [2 a Na (x) — ( 1) 7 rie” 12) 


nti 


j d anpd\  dyd\ 
and we can consider =—; Tari 3s either +, A 5) or zi> ). Since 


— A e=) == ys eT %]2 


dx 


a 
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and 
a" Latan dr” Lat d” 
Tae P) = xT 1) + nTa 


-1 
(6 *’), 
the first of these interpretations yields the recursion formula 
In+i(*) = Xn(x) — 21) n-1(X). (3.6) 
From the second we find 
de os at 
— Wyle” 2 na(x)] = env? Nn+1(%) 
or 


Mnl) = — ni = t 1) n(%). (3.7) 


On subtracting the recursion formula (3.7) from (3.6) we find 
the simple relation 


d 
TE = Mh (3.8) 


Differentiating (3.7) and substituting (n + 1), for the derivative 
of 7,4, in accordance with (3.8), we obtain the differential equation 


The equation for ¢,(&) is consequently 


a7 -Ëp + (n+ 5 )bn = 0. (3.9) 


On going over to a new unit of length by the substitution 
x = «é, the left-hand side of (3.3) is equal to the left-hand side 
of (3.9) multiplied by 42/2ma* provided 


mt a w yal” + 5) = E, 


Ima? 4 2’ 2m? 


Let w = Va/m denote the classical frequency of the oscillator. 
The first of these conditions determines the new unit of length a: 


2 h h 
gí = 2e I= = 
2./am  2mw’ 
and the second requires that 
E = E, = hw(n + 4). (3.10) 


It is possible to show that the ¢,(&) constitute a complete ortho- 
gonal system,® and consequently there can exist no further 
characteristic numbers and functions. The oscillator possesses 
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the discrete energy levels (3.10) at intervals hw apart. That the 
lowest energy level turns out to be $hw instead of 0 is of itself of 
no significance, as we may always introduce an additive constant 
into the energy, although it is meaningful to assert that the least 
possible value of the quantity H, (3.2), is equal to thw. 
However, the wave equation not only yields the energy levels 
as characteristic values, but it also gives us information con- 
cerning the probability of localization by means of the character- 


an . . h 
istic functions. For convenience we now take «= J Img as the 


unit of length. When the oscillator is in the state described by 
the n™ energy level, the probability that the oscillating particle is 
at a distance x from its position of equilibrium is given by 
e~*"/2.y2(x). These probabilities are to be understood as 
relative, and refer to equal infinitesimal intervals about the 
points of comparison x. In particular, for the lowest energy 
level n = 0 the probability density is e~*’?; we can therefore 
no longer say that the mass-point is at rest in the position of 
equilibrium, but rather the probability of its displacement from 
this position 1s given by a Gauss error curve. The normalized 
characteristic functions of (3.3) are given by 


Halt) = = dal) 


On expressing any function (x) of position in terms of this set 


co to 
Wa) ~ Saale), a = fleibnlalde 


and the operator belonging to the energy H is, as we have already 
seen, expressed in terms of these co-ordinates Y, by 


Xn > h(n + 4) + xy. 


In order to find the operator associated with the co-ordinate x 
we must express xy,(x) linearly in terms of the characteristic 
functions themselves ; by (3.6) we have 


Xon = Pny + MP1 


whence 
n NY n- r — 
apa = Nagy tO = VIE baa + Van 


The correspondence (x) > x(x) is thus expressed in terms of 
these Fourier coefficients by 


Xn > Vn Xna tH Vn + ltn; 
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its matrix ||@nn|| contains only the elements 


In, n-1 = Vn, Gn, nt+1 = Vn + l. (3.11) 


(On returning to the original unit of length the right-hand side 
must be multiplied by the factor «.) On applying the operator 


£ to „n we obtain, in accordance with (3.8) and (3.6), 


don 
at = 4 Non — Pnat 
whence 


os = (vn Bana Vn + lny) 


The linear Hermitian correspondence associated with the mo- 


mentum p = oe is accordingly 


Xn = (— V2 Xn +Vn+1 Xn+1) , 


its matrix ||p,,,|| has as its only non-vanishing elements those 
for which m =n 1: 


h h ,—~ 
Ên n- = — z; V^, Punti= avn + 1. (3.12) 


(On returning to the original unit of length these elements are 
to be multiplied by 1/a.—Terms with the index n — 1 are to 
be omitted when n = 0; in fact, they automatically drop out 
of the above formule.) 


§ 4. Spherical Harmonics 


In order to discuss the energy levels of an electron in a 
spherically symmetric electrostatic field we must first discuss 
Spherical harmonics and their principal properties. 

1. Definition —Let r denote the distance from the origin in 
the 3-dimensional space with co-ordinates x, y, z, and let v, 6, $ 
be polar co-ordinates with polar axis along the positive 2 
direction : 

x + iy =r sin ĵet, z= r cos 8. 


On setting a homogeneous polynomial u of 6 degree in x, y, 2 
equal to r'- Y,, Y, depends only on the directional co-ordinates 
6, $ and is a function of position on the unit sphere. If u is 
a harmonic function, i.e. if it satisfies the equation Au = 0, 
Y, is said to be a surface harmonic of degree | and the harmonic 
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function u itself is said to be a spherical (or solid) harmonic of 
degree l. Since in polar co-ordinates 


_ 1 d/ du l 
Au = 33 e) + Au, 
1 {d7/. „du l du 
“= sin af sa (sia 0) T Sn 8 sm} (4.1) 
the surface harmonic Y, satisfies the differential equation 
AY, + (l+ 1)Y,= 0. (4.2) 


2. Orthogonality—On applying Green’s formula to the 
spherical harmonics u = r*Y,, v = r'Y, on the interior of the 
unit sphere, we obtain the orthogonality relations 


{Vi Vidw=0, k+l, (4.3) 


in which dw = sin 6d0d¢ is the surface element on the unit sphere. 
Since the conjugate complex Y, of a surface harmonic is also a 


surface harmonic, the first factor in (4.3) can be replaced by Y, 
3. Basis.—On writing 


E= x + 7, q = x — 1y 

the differential equation Au = 0 becomes 
du Pu a. 
~ DEN TS 95 


we see that a homogeneous polynomial w of degree l in €, y, 2 
breaks up into harmonic polynomials u™) : 


u = Zur), (m= —1,-+-+,l—1,)) 


where u™) consists of all terms in which the exponents of & and 
n have the fixed difference m. The recursion formula for the 
coefficients of u™, which is obtained from the differential 
equation Au = 0, further shows that there exists one, and to 
within a multiplicative constant only one, such harmonic u'™, 
Accordingly, there exist exactly 2/-++ 1 linearly. independent 
surface harmonics of degree l; we may take them to be the 
Y™) defined by : 


Au 


y™) = y! ° yim), 
Writing 
ul) = (x — iy)” = P= (x + iy)” Py 
and r placing 
(x + iy\(e — iy) by r— e, 
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P and P, depend only on 7? and z. Hence on taking r= 1 
we have 

yim) = eid (sin 6)—™ - P\™) (cos 8). (4.4) 
For m = — l we take P=1, and for m= +. P,=1; 
P(z) = (1 — 2*)' for this latter case. Since Y‘™) depends on 
¢ only in the factor e'™?, 


[YPV dea = 0, m +m. (4.5) 
This basis Y™), in which the z-axis occupies a preferred position, 
is accordingly unitary-orthogonal. 

4. Completeness.—That the totality of surface harmonics 
constitute a complete orthogonal system on the unit sphere can 
be proved by showing that any polynomial in x, y, z on the 
sphere can be written as a sum of surface harmonics. Now 
the general polynomial of degree / contains 

(+1 +++ 


arbitrary constants. But exactly this same number of linearly 
independent homogeneous polynomials are contained in the 


expression 

Vit Yia t: Meat (tty + suet: +], (4.6) 
for the polynomials of the form r'Y,, r'V1_,, - °° are linearly 
independent in virtue of the orthogonality of surface harmonics. 
r'Y, contains exactly 2/-+ 1 = (l + 1) + l linearly independent 
functions, and consequently (4.6) contains exactly 

(@+ 1) +0+ (0-1) +0—-H]+-- 
as asserted above. 
5. Closed expressions for the surface harmonics.—On sub- 
stituting (4.4) in (4.2) we obtain the differential equation 
d2p dP 

(1 — 2) A + 2(m — leat RU + 1) —m(m — 1)]):P=0 
for the polynomial P = P% in z = cos 0. From this equation 
we find that ae satisfies the same differential equation on re- 


placing m by m — 1; we thus obtain the recursion formula 
aP(™) 
(m—1)(¢) — “—_— 
Pim—1)(g) Ze 
and the expression 
a'-™ 


(m/o) — 
P l (2) dz'-™ 


(1 — 2°)’, 
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In particular, the “‘ zonal harmonic ” 
ay — 2%)! 
Pie) = PO) = EA. 
6. Further formule.— 
Jx¥i¥i dw = 0 (4.7) 


unless 1 — k = +4 1. For x-r®V, is a polynomial of degree 
k + 1 and may, in accordance with 4, be expanded in the form 
revs t+ Yr- t: e +). Consequently on the unit sphere 
xY r = Yri H Yeates: (4.8) 

and the only values of l Z k for which the integral (4.7) can 
have a value other than 0 is L= k+ 1. Hence our assertion 
(4.7); it also follows from the above that only the first two 
terms can appear in (4.8). 

Further, we shall also have occasion to use the differential 
expressions 


Lu = IT a) Lyu, Lm, (4.9) 
Lu = L,(Lu) + Li(Lyu) + £,(L.u) 
in terms of polar co-ordinates. On setting in 
ou ou 
y? + Tk 


the changes dx, dy, dz obtained by allowing ¢ to increase by 
dd and holding r, 0 fixed, we obtain immediately 


Ou 


Similarly, 
d .cos ĝ d 
. — ig/ © aa 
Le + ily = (sy tig 5) 
d cos 0 d 
a 7 —- ~to eet eee aes 
L, — ily =e ( z t i 534)” (4.10) 


L? = — A [eq. (4.1)]. 


§ 5. Electron in Spherically Symmetric Field. 
Directional Quantization 


Now back to physics! Consider an electron of charge — e 
revolving about a fixed nucleus of charge Ze situated at the 
origin. For Z= 1l we have the hydrogen atom, for Z = 2 
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singly ionized helium Het, for Z = 3 doubly ionized lithium 
2 

Litt, etc. The potential energy is V = — a we shall, 

however, for the present take V (r) more generally as any function 

of the radius r. The wave equation for the determination of 

the energy levels is then 


“Ab + [E — Vi) = 0. (6.1) 


On expanding in terms of surface harmonics ý becomes a sum 
of terms f,(r)Y, (J = 0, 1, 2, ++). The differential operator 
on the left-hand side of (5. 1) ‘sends the /™ term of this sum into 
Y, times 


Laa EEA + E VO). 62) 


Consequently each a term must satisfy the differential 
equation separately; we thus obtain a complete set of char- 
acteristic functions of the form 


p= filr) Y 
The factor f,(r) depending only on r must be such that (5.2) 
vanishes and frf) filr)dr converges. Denoting the char- 
acteristic numbers and characteristic functions of this differ- 
ential equation by 
En, fault) (n=0,1,2,° +>), 


Ey. is a (2l + 1)-fold energy level, as the expression fm(r)Yr 
contains 2/-+1 linearly independent characteristic functions 
associated with this single characteristic value; we may choose 
as a basis the functions 


yy = fil) yp) (m = l, r. l — l, l). 


We thus arrive at three integral quantum numbers: the 
“ yadial quantum number" n, the “‘ azimuthal quantum number ” I, 
and the “ magnetic quantum number” m. The energy level 
depends only on the first two. 

In justification of this nomenclature we determine the angular 


momentum h of the electron with components 
hLy = Ypa — Py ° > 


In quantum mechanics L,, L,, L, are the operators (4.9). 
Hence for 


yim) = falr) Y = et. (a function of r and 0) (5.3) 
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we have, in accordance with (4.10), 


L, up —= m* if, 
and for the general characteristic function 


| b= fault) Vy (5.4) 
with azimuthal quantum number / 

Lie = 1 (1 -+- 1)+u. 
Hence in the state described by (5.4) not only the energy has 


a definite value Enn but also the absolute value of the moment 
of momentum 


| 92 = 10+ 1) | (5-5) 


The significance of the azimuthal number is that it fixes this 
magnitude. It is indeed remarkable that there exist states 
l= 0, n= 0, 1, 2, * ++ with spherically symmetric character- 
istic functions ẹ% = fno(r) for which the moment of momentum 
vanishes. In the states described by (5.3) not only the energy 
and the absolute value of the moment of momentum have 
definite values, but also the z-component of the moment of mo- 
mentum assumes a definite value with certainty, for then 


| L, =m. | (5.6) 


Since a magnetic dipole moment 


2—2 eh 5 
S Spc R (5.7) 


is associated with the angular momentum Ag of the revolving 
electron (the mass of the electron being denoted by » whenever 
there is danger of confusion with the magnetic quantum number 
m), the influence of Q will be felt on subjecting the atom to a 
magnetic field. The existence of the Zeeman effect under such 
conditions can be traced to this cause. A fundamental ex- 
periment to observe the magnetic moment of the electron directly 
is due to Stern and Gerlach. Let a stream of one-electron atoms, 
which are all moving in the direction of the x-axis and are in 
the state (n, l) with energy level Ep, be subjected to an in- 
homogeneous magnetic field in the direction of the z-axis. Let 
‘the x- and y-components of the magnetic field vanish in the 
(x-2)-plane, in which the beam moves, and let the z-component 
be a function of z alone. A magnetic dipole, the 2-component 
of whose moment is s,, is then acted upon by a force a. Sz 
in the positive z-direction. In consequence of (5.6) the atomic 
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beam should be broken up into 2l + 1 smaller beams by the 
force in the z-direction, corresponding to the various values 
m =l, l— 1l, —l of the magnetic quantum number. 
On performing the experiment on silver atoms in the normal 
state two beams, corresponding to m = + 1, were observed; 
the value of the ‘‘ Bohr magneton,” the elementary magnetic 
moment corresponding to one unit of angular momentum, was 
found to agree with the value = obtained from (5.6) and (5.7). 
Why the unperturbed beam corresponding to m = 0 did not 
appear remained unexplained. l 

The older quantum theory, which employed the quantum 
number k =} + 1 with values 1, 2, - - -, allowed m to assume 
the integral values from — k to + k; it seemed plausible to 
exclude the case k = 0, although one was thereby led into 
difficulties on applying the so-called ‘ adiabatic hypothesis ” 
to the behaviour of an atom under the influence of crossed 
electric and magnetic fields. In the new quantum theory no 
ad hoc hypothesis is required for this exclusion, as Z can assume 
only the values 0, 1, 2, °°. But according to either the old 
or the present scalar wave theory there should exist an odd 
number of permissible values of m for given k or L; the exclusion 
of the case m = 0 apparently required by the Stern-Gerlach 
experiment cannot be accounted for on either theory. Nor 
can we explain the related fact that in the anomalous Zeeman 
effect m may assume either an even or an odd number of values, 
according to the nature of the atom under consideration. 
Obviously something is lacking in our present scalar wave 
theory as well as in the older formulation; we return to this 
point again in Chap. IV, §4. The older quantum theory 
described the situation met above as “ directional quantiza- 
tion’; since the absolute value of the moment of momentum 
was hk and the component along the z-axis was hm, it concluded 
that the magnetic axis of the atom could assume only positions 
described by the inclination 0 with the z-axis determined by 
the formula 


cos 0 = 7 (m = 0, +41, 42,» +2). 


Thus in the case k = 1 we should expect only three possible 
orientations for the magnetic axis: parallel and anti-parallel 


to the field, which we have taken in the direction of the z-axis, ` 


and perpendicular thereto—unless we empirically exclude this 
latter possibility m = 0 because of the Stern-Gerlach experiment, 
in which case we have but two. In either case we find ourselves 
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faced with a serious dilemma, for the direction of the z-axis is 
an arbitrary direction in space. In order to avoid this one 
then assumed that the quantization was due to the influence of 
the magnetic field, and consequently the preferred z-direction 
was interpreted physically as the direction of the magnetic field. 
But even so the difficulty is not avoided in the limiting case of 
vanishing magnetic field, for the directional quantization should 
be maintained in arbitrarily weak fields. Or stated more 
physically, the radiation mechanism required by the Stern- 
Gerlach effect for the orientation of the atoms, which were 
originally in random orientation and precessing about the 
Z-axis, requires about 108 izmes as long as the greatest time 
consistent with the observations. The stand taken by the new 
quantum theory on this point is fundamentally different. The 
possible states (n, l) of the atom are described by the functions 
# of the (21 + 1)-dimensional linear family 


+1 
ys = Ful?) Y, p È Kon ' Fal”) yy 

or by the vectors of a (2l + 1)-dimensional space with com- 
ponents Xm. The 2-component of the moment of momentum, as 
well as the component in any arbitrary direction, is capable of 
assuming only the discrete values hm (m= 1, 1—1,+-+-+, — J. 
But in a state in which the z component, for example, assumes 
the value hm with certainty there is only a certain probability 
that any other component will assume a definite one of its 
possible values 2-0, kh: (41),°+°, A> (+2). The name 
“ directional quantization ” is hardly an appropriate description 
of this situation.® 

When the electro-static central force satisfies the Coulomb law 
and originates in a nucleus of charge + Ze, the differential 
equation (5.2) for the “ radial characteristic function ” f = fmlr) 
becomes 


(2e +) f)+ am (Er + Ze?) f = 0. 


dr? Y 
The character of this equation is unchanged on going over to 
the new dependent variable v defined by rf = e% + v: 
ay dv 2m 2nZe* E +D 
gt (e+ Gr) + ae 


We choose « in such a way that the constant term in the co- 
efficient of v vanishes : 


ha? = — ImE. (5.8) 


ae aie o, -7-a a -— = Ias Iak yee aE Le 
Jl wR a mr A wt ee eT a a F. 
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We know from the general theory of linear differential equations 10 
that there exist solutions of this equation 1n the neighbourhood 
of the (regular) singular point r= 0 in the form of a power 
series 


v= È ar" 


in which the exponent p begins with a certain value po, which 
need not be an integer, and runs through the values mo, mo + 1, 
ty + 2, °°. On substituting this power series into the equa- 
tion we find the recursion formula . 


Zme? 
(ue +1) — W Djana = 2af u — Fr) (8.9) 


for the coefficients a, In order that it be satisfied for u + 1 =o 
(a. = 0, Gus. # 0) we must have 


Polko — 1) = Ul + 1). 
We thus have the two possibilities : 
fo =I+1 or p= —.. 


Considering the first possibility and taking the coefficient a,,, 
of the lowest power as unity, all remaining coefficients can be 
obtained by successive applications of the recursion formula 
(5.9), as the denominator u(u + 1) — I(l + 1) never vanishes ; 
let the solution thus obtained be denoted by v. The second 
possibility does not lead to a solution, however, as the denomi- 
nator in the recursion formula for u = l vanishes; the second 
solution of the differential equation can be obtained by quad- 
rature from the first and involves logarithmic terms. 

The power series for v breaks off if for a definite exponent 
B= po tn 


Zme? 
"e= 
or 
Zme? 
ha = hin - T+ 1) (5.10) 


In this case f is of the form 


e~% + y! - (polynomial of degree n in r); 


it is finite at r = 0 and the integral 


frfrfir)dr (5.11) 
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exists, as is to be required. The corresponding characteristic 
numbers E are the energy levels; on writing n in place of 
n--t-+ 1 and solving (5.8), (5.9) for E we find 


Z>me4 1 
Ih? nE 


The integer n, the principal or total quantum number, is 
subject to the condition 2>J. There exist no other solutions 
for which the integral (5.11) converges.’ 

The energy levels depend only on the principal quantum 
number n; the terms for which n is a fixed number and 
l= 0, l, ++», n — l coincide in a single degenerate term E, 
of multiplicity 


E, = — (5.12) 


n=l 
Z (U41) = n?. 
l=0 


This theoretical result agrees with the empirical formulæ for the 
Balmer, Paschen, Lyman, etc., series. We find, in fact, the 
expression 


, E 
for the terms measured in wave-numbers (= = 5-5). The 
Ince 2rch 


expression for the Rydberg constant R in terms of the fundamental 
constants of nature (the charge and the mass of the electron, the 
velocity of light and the elementary quantum of action) agrees 
numerically with its empirical value. All terms and therefore 
all actual line frequencies v depend on the integer Z describing 
the charge on the nucleus in such a way that Vy increases in 
Proportion with Z. Sinée the X-ray terms are due to the inner- 
most electrons, which are but slightly affected by the outer 
ones, we should expect to find that the hardest X-ray lines, 
arranged in accordance with the atomic number Z, follow this 
law. It was discovered by Moseley and gave a conclusive proof 
of the fact that on going through the elements of the periodic table 
the charge on the nucleus increases by e from element to element. 
This law uncovers with unerring certainty the holes yet re- 
maining in the system of known elements; at present we lack 
but 2 (or 3) elements in the series beginning with hydrogen, 
Z = l, and ending with uranium, Z = 92. | 

The characteristic functions associated with these energy 
levels, which determine the relative probabilities of the various 
positions of the electron, can be expressed in closed form in 
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terms of the so-called Laguerre polynomials. The character- 
istic function belonging to the normal state n = 1, l = Q, is 
spherically symmetric : * 


l 


us = VE . e7r/a ; (5.13) 
for hydrogen , 
a= pa = 0-532 A (5.14) 


(According to the older Bohr theory, a is the radius of the inner- 
most electronic orbit.) a determines the order of magnitude 
of atomic dimensions. In the normal state hydrogen possesses 
spherical symmetry (according to the scalar wave theory—but 
see Chap. IV, § 8). 

The radial characteristic functions r * fn,(r) do not, however, 
constitute a complete orthogonal system for a given / for the 
full domain which we wish to consider: in addition to the 
discrete term spectrum (5.12) we have the continuous spectrum 
covering the whole region E 20. We go no further into this 
matter. 


§ 6. Collision Phenomena 


The optical phenomena show that the quantum theory leads 
to the correct energy levels, but they do not lend themselves 
to an attempt to interpret the vector % in system space as a 
probability. Collision phenomena, which deal with the de- 
flection of electrons or «-particles under the influence of other 
material bodies, are best suited for this latter purpose. The 
fundamental experiments of Franck and Hertz, as well as those 
of Davisson and Germer, belong to this latter category. 

Neglecting the reaction of the moving particle on the per- 
turbing body, the potential energy due to this latter may be 
taken as a given function V(xyz) of position. Considering 
a one-dimensional problem, the energy of the moving particle is 
then 


l 


We can think of the curve y = V(x) as the contour of a hill 
against which the particle runs, The wave equation for a 


* The normalizing factor 1/~/' ma? is calculated from 
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state with given energy E is | 


x oe + EV] p= 00. (6.1) 


If we neglect for the moment the perturbing field V we obtain 
as solutions of (6.1) the familiar de Broglie waves: W is a linear 
combination of the waves e'#* and eiss proceeding in the positive 
and negative directions along the x-axis, the wave number « 
of which is determined by 


(ha)? = 2mE or ha =p. 


Writing 
2m 
gz (x) = U(x) 
equation (6.1) becomes 
dp 
TE + [a — Ula y = 0. (6.2) 


We now assume that as x > + œ, U(x) behaves in such a way 
+00 
that the integral flv (x) |dx converges; equation (6.2) then has 


~ OO 
one solution which behaves for x —> + œ asymptotically like 
e'**, and another, which is linearly independent of the first, 
which behaves like e~*** in the same region, 
This can most readily be seen by solving (6.2) by the method 
of successive approximations. Let 


b= to + ty t+ peters: (6.3) 
and take as the 0“ approximation the function e**: in general 
%n4, is determined in terms of p, by integrating the equation 

d 

Pott A at pap = Ul) 


Hence 
00 


Ynan(e) = — frina — £) UE) pal) de (6.4) 


x 


We restrict ourselves for the moment to a region x 2 x such 
that 


00 
~{|U(2) dx =p <l. 


me -aeee ee oM M 
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If (Yala) |Sa, for all x, the integral (6.4) converges and we have 


nia(X)| San’ SLOL : 


we can therefore take a, = 1, an, = gan. Then a, = g” or 
lwa(x)| Sg" for x 2 xo 


Consequently the series for ù% converges at least as fast as the 
geometric series with ratio g. It satisfies the integral equation 


Oo 


$2) — Yale) = — =[sinale—@)- UE we) ae (6.8) 


and is consequently a solution of (6.2). Since 


p Slt+g+g7+-: a 
(6.5) leads to the estimate 


le) — dole) S EH fEl, 


from which it follows that (x) behaves asymptotically for 
x- + œ like p(x) = e*s, Not only is p~ yo, but also 
E n He for the equation 


gives as an upper bound for the absolute value of the difference 
on the left-hand side the quantity 
(e @) 


r fio 


which approaches 0 as x > + œ. 


The solution y(x) which we have found in the region x 2 xo 
can naturally be extended over the entire real axis by analytic 
continuation. Since our considerations apply just as well for 


x —> — oo, we know that ¢(x) satisfies an asymptotic equation 
of the form 


p(x) ~ bers + b'ei for x—> — o. 
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At the same time we must also have 


TE mw ialbeise — levine), 
w(x) being a solution of the differential equation, (x) is 
also : 


TE pe UW = 0, + ft — Ula) 0. 


Multiply the first equation by %, the second by w and subtract ; 


we find 
LOL- E) = =0 


or 
dys dip 


The determinant (6.6) has the limiting value 27a for x — + oo 
and for x > — œ 


Qia(bb — b’b’), 
whence 
bb — b'b' = 1. (6.7) 


It follows from this that b + 0. On multiplying (x) by 1/b 
we have a solution % whose asymptotic behaviour is described 
by the equations 
U(x) ~ ért +. a'ei for x> — oo, 
p(x) ~ aers for x—-> + œ (6.8) 
where a = 1/b, a’ = b'/b. (6.7) is now 


lal? + |a’|? = (6.9) 


A particle of definile energy runs against the potential energy 
hill from the left, i.c. from x= — œ. Whereas in classical 
mechanics the particle certainly either gets over the hill or is thrown 
back, according to whether its initial kinetic energy 1s greater or 
less than the maximum of V(x), quantum mechanics states that 
there is a probability |a|? that it gets over and a probability |a'|? 
that it 1s thrown back. Furthermore, these probabilities are 
continuous functions of the energy of the particle; the dis- 
continuity of the classical theory is completely broken clown. 
If we perform the experiment successively with a large number 
of particles we find that they are divided into two streams, 
in accordance with (6.8.), proceeding in the positive and negative 
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directions along the x-axis; the relative intensities of these 
are given by 1 and |a’|? for «> — oo, respectively, while for 
x—> + œ there exists only the positive stream of intensity 
|a|? Equation (7.5) thus expresses the conservation of the 
number of particles and shows that we must consider the square 
|a|? of the absolute value of the amplitude a as a relative intensity 
or probability. 
If the integral 


the solution W is represented throughout the whole space by the 
formula (6.3). In perturbation theory one is usually satisfied 
with the first term #,. The theory of the familiar experiments 
of Rutherford, in which «-particles are allowed to fly in a given 
direction with given momentum into and be deflected by the 
field of an atom, has been developed by Wentzel in a similar 
manner.2 The influence of the «-particle on the atom is thereby 
neglected; on taking it into account we are led to the theory 
of the experiments of Franck and Herts, giving formule for 
the dispersed particles specified according to their various 
discrete kinetic energies and their various directions. This 
calculation has been carried through for hydrogen by Born and 
Elsasser.% A very important application of this picture of 
corpuscular waves “ seeping ’’ through a potential hill has been 
made by G. Gamow and R. W. Gurney and E. U, Condon to 
explain radioactive decay.‘ 


§ 7, The Conceptual Structure of Quantum Mechanics 


The fruitfulness of the theory has been amply established by 
the above applications and the examples given have served to 
illustrate its physical interpretation ; it now seems time to set 
forth its general abstract formulation. 

Consider a physical system of known constitution. Each 
particular state, each individual case of such a system is repre- 
sented by a vector ¢ of modulus l in a unitary system space. Each 
physical quantity associated with the system is represented by an 
Hermitian form in this space. The fundamental question which 
we put to the theory is not, as in classical physics, ‘‘ What value 
has this physical quantity in this particular case ? " but rather 
“ What are the possible values of the physical quantity A, and what 
is the probability that it assumes a definite one of these values in 
a given case?” The answer to this question is: The probability 
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that A assumes the value a. is the value Ex(t) of the characteristic 
form Ez of A associated with the value x, where the vector x repre- 
sents the case in question and the quantity A is represented by 
the Hermitian form A in the system Space. The quantity repre- 
sented by A is capable of assuming only those values « which 


are characteristic values of the form A. In accordance with the 
equations 


= Es (i), Ale) = Saka) 


the sum of the probabilities is 1 and the value A(z) of the form 
A is the mean value or expectation of the quantity A in the state y. 
Since all assertions concerning the probabilities in a given state 
g are numerically unaltered when ņ is replaced by e r, where € 
is an arbitrary complex number of modulus 1, we cannot dis- 
tinguish between these two cases. The pure case or state is 
consequently more properly represented by the ray g than by 
the vector x, and we must therefore operate in the ray field in 
system space rather than in the vector field. 

The significance of probabilities for experimental science is 
that they determine the relative frequency of occurrence in a 
series of repeated observations. According to classical physics it 
is in principle possible to create conditions under which every 
quantity associated with a given physical system assumes an 
arbitrarily sharply defined value which is exactly reproducible 
whenever these conditions are the same. Quantum physics 
denies this possibility. We illustrate this by the example of 
directional quantization. We know conditions under which we 
can guarantee with practical certainty that the atoms of a 
hydrogen gas are in the normal state. Let us therefore assume 
that we can create conditions under which we can be certain 
that the atoms under observation are in the quantum state (n, l) 
with azimuthal quantum number ]=1 and energy E. A 
certain quantity L., which can, under these conditions, assume 
only the values + 1, 0, or — 1 is associated with each direction 
z in space. Stern and Gerlach have shown us how to sharpen 
these conditions so that L, takes on a definite one of these values, 
say L,=+1. According to the theory the utmost limit of 
precision is then reached. If x is another direction in space, 
then under these conditions which determine L, and E only the 
relative probability that the quantity L, assumes any one of the 
values + 1, 0, — 1 can be given. Why is it impossible to go 
further and insure conditions under which in addition La takes 
on a definite one of the values, say 0, with certainty ? Because 
the “ measurement ” of L,, which is accomplished by separating 
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the atoms into three classes L, = + 1, 0, — 1, is only possible 
by creating conditions which destroy the homogeneity already 
existing with respect to L,. Polarization of photons is obviously 
somewhat analogous to directional quantization of atoms. The 
conditions for the production of a monochromatic beam of light 
in a definite direction determine the energy and momentum of 
the photons. To each orientation s of a Nicol prism corre- 
sponds a definite quantity A, which is capable of assuming only 
the values +1; if A,=-+ 1 the light goes through and if 
A, = — 1 it does not. With the aid of such a prism we separate 
out the photons for which A,=1 without disturbing their 
energy and momentum. The utmost limit of precision is then 
reached ; a monochromatic pencil of polarized light is the most 
homogeneous light possible. If we now place a second Nicol 
of orientation o in the path of this beam, then naturally only 
those photons which have A, = + 1 can pass through. But 
the light which we thus obtain is of the same constitution as 
if the first Nicol of orientation s were not used at all; the con- 
dition that all the photons have A, = + 1 is obviously destroyed 
by the second Nicol. 

Natural science is of a constructive character. The concepts 
with which it deals are not qualities or attributes which can 
be obtained from the objective world by direct cognition. They 
can only be determined by an indirect methodology, by observing 
their reaction with other bodies, and their implicit definition is 
consequently conditioned by definite laws of nature governing 
reactions.‘® Consider, for example, the introduction of the 
Galilean concept of mass, which essentially amounts to the 
following indirect definition: ‘‘ Every body possesses a mo- 
mentum, that is, a vector mb having the same direction as its 
velocity v; the scalar factor m is called its mass. The mo- 
mentum of a closed system is conserved, that is, the sum of the 
momenta of a number of reacting bodies is the same before 
the reaction as after it.” On applying this law to the observed 
collision phenomena data are obtainable which allow a deter- 
mination of the relative masses of the various bodies. But 
scientists have long held the opinion that such constructive 
concepts were nevertheless intrinsic attributes of the ‘‘ Ding an 
sich,” even when the manipulations necessary for their deter- 
mination were not carried out. In quantum theory we are con- 
fronted with a fundamental limitation to this metaphysical stand- 
point. 16 

We have already seen, toward the beginning of this chapter, 
that a co-ordinate x and its associated momentum p stand in 
a peculiar relationship to one another: the precise determina- 
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tion of either one of these quantities precludes the precise 
determination of the other. In the state represented by the 


* +90 
wave function p(x) [ J ù dx = 1] the mean values x = <x> and 
— 00 


Py = <p> are given by 
+00 +0 J 
E B(x) plede and |y Hax 
=m 0O —00 


No loss of generality is incurred by taking these mean values 
as zero; the first can be made to vanish by replacing x by 
x — % or P(x) by w(x + x) and the second by replacing (x) 


by e( — re ‘ p(x). The mean values (Ax)?*, (Ap)? of (x — x»)? 
(p — po)? are then given by 


+0 
(Ax)? = [x%P(x)p(x)dz, 


vee dy Tab dy 
2 — fe nika = haja Ze 
(Ap)? = refa) sade = h | = d 
—00 — CO 
From these expressions the general inequality 


Ap: Ax Z th 


can readily be obtained (I am indebted to W. Pauli for this 
remark) ; the less the uncertainty in x, the greater the un- 
certainty in p, and conversely. * 

In general the conditions under which an experiment is 
performed will not even guarantee that all the individuals con- 
stituting the system under observation are in the same “ state,” 
as represented in the quantum theory by a ray in system space. 
This is, for example, the case when we only take care that all 
the atoms are in the quantum state (n, 1) without undertaking 
to separate them, with respect to m by means of the Stern. 
Gerlach effect. In order to apply quantum mechanics it is 
therefore necessary to set up a criterion which will enable us to 
determine whether the given conditions are sufficient to insure 
such a “pure state.” We say that the conditions ©’ effect 
a greater homogeneity than the conditions © if (1) every quantity 
which has a sharp, reproducible value under @ has the same definite 


“Cf. Appendix 1 at the end of the book. 


78 QUANTUM THEORY 


value under @’ and if (2) there exists a quantity which is strictly 
determinate under @’ but not under ©. The desired criterion 
is obviously this: The conditions © guarantee a pure state if it 
is impossible to produce a further increase in homogeneity. (This 
maximum of homogeneity was obtained in classical physics 
only when all quantities associated with the system had definite 
values.) 

In the pure state represented by the vector a = (a;), a quan- 
tity Q represented by the Hermitian matrix Q = l|gixl| has the 
expectation or mean value 


<Q> = pA Aråillir 


The numbers 
Qik = aja, (7.1) 


are the components of a positive definite Hermitian form A of 
trace 1, i.e. 


|(ag)|? = Aah, : Liiki 


(Positive definite is to be understood here in the weakened 
sense A(t) 20.) It is to be noted that <Q) depends linearly 
and homogeneously on the quantity |l¢;,|| under consideration : 


Q = tr (AQ). (7.2) 


If a statistical aggregate A is created by subjecting a large number 
of individuals of the physical system under observation to the 
conditions Œ, then the mean value of a physical quantity Q 
will be given by (7.2) where A is a certain positive definite 
Hermitian form of trace 1 which is characteristic for the 
agegregate—even if the conditions © do not guarantee maximum 
homogeneity. The reason for this is that (7.2) is still correct 
if we mix statistical aggregates, each of which does possess 
maximum homogeneity, in any proportions; any statistical 
case may indeed be considered as a mixture of pure states. 
As F. v. Neumann has remarked, this formula (7.2) can be derived 
from the simple axioms !” : 

1. If P, Q are physical quantities and A a real number, then 
AP) = AP), <P + O) = <P) + QD. 

2. If the quantity Q is capable of assuming only positive 
values (i.e. if the form Q is positive definite), then <Q> = 0. 

3. If Q is a pure number, i.e. if it is independent of all 
physical conditions, then <Q> = Q. 

Assuming not only that any physical quantity Q is repre- 
sented by an Hermitian form, but also that conversely any 
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Hermitian form represents some quantity associated with the 
system, it follows from (1) that 
<Q> = 2, Arifit, 


t, 


where. the coefficients a,; are independent of Q. (We shall 
return to this assumption in Chap. IV, § 9.) The matrix 
A = |\a,,|| must be Hermitian since (Q> is always real. On 
bringing A into the normal form 2«,x,%, (2) requires for the special 
Hermitian forms of the type Q = 24,x,@, that Saq; 20 for 
arbitrary non-negative values q;; consequently a, 20 and A 
is positive definite. 

The probability that in the statistical aggregate A the quan- 
tity Q assumes the value x is 


w = tr (AE,) (7.3) 


where E, is the idempotent form associated with the character- 
istic number x. 

We can also distinguish ‘‘ pure states ” among general sta- 
tistical aggregates, ‘‘ mixed states,” by the fact that they cannot 
be obtained by mixing two or more different statistical aggregates. 
This corresponds to the theorem that an Hermitian matrix 4 of 
the form (7.1) is not expressible as the sum B ++ C of two positive 
definite Hermitian forms B and C which are not merely multiples 
of A. This can be readily proved on taking the vector a = (a,) 
as one of the co-ordinate axes in system space. The positive 
definite Hermitian forms A with unit trace, i.e. the statistical 
aggregates, constitute a convex region © in the sense that with 
A and B their “ centre of mass” AA + uB (A, p arbitrary positive 
numbers whose sum is unity) belongs also to ©. A point of © 
which cannot be considered as such a centre of mass of two 
points of © distinct from the point in question is called, following 
Minkowski, an * extreme point. !8 © is the “convex core” of 
the class © of all extreme points, i.e. it is the smallest convex 
domain which includes all the points of © We cannot dispense 
with a single extreme point of ©; if we leave out but a single 
point of © the entire convex core shrinks together. We may 
accordingly characterize the pure states as the “ extremes’ among 
all the possible statistical aggregates. 

It is often convenient to dispense with the normalization 
tr A = l; (7.3) then gives the relative rather than the absolute 
probabilities. The simplest statistical aggregate is that one 
characterized by the unit Hermitian form with matrix 1; it 
represents total ignorance. In thermo-dynamics the important 
rôle is played by the canonical aggregate A = e~¥i*®; H is here 


H 
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the Hermitian form which represents the energy, k the Boltzmann 
constant and the number @ the temperature.?® 


§ 8. The Dynamical Law. Transition Probabilities 


Having considered the general probability laws of the quantum 
theory, we now turn to the dynamical law governing the change 
in the state ¢ of a physical system during an interval dt of time. 
The dynamical law states that this change is effected by 


H.y, where A is the 


Hermitian form which represents the energy : 


the infinitesimal unitary operator — 


(8.1) 


The peculiar significance of the energy in quantum mechanics 
is due to its appearance in the dynamical law. We also consider 
this law as a fundamental axiom of quantum theory of universal 
validity. For the matrix X : 


Kir = Xiks 


which characterizes a statistical aggregate of the pure state 
described by the vector g = (x;) [cf. eq. (7.2)], we obtain the 
equation 
hax 
7a AH — HX (8.2) 
on applying (8.1) and taking into account the fact the H is 
Hermitian. This same equation also governs the change in 
time of a statistical aggregate X for a mixed state.?° 
For the integration of (8.1) it is convenient to choose as our 
co-ordinate system the characteristic vectors of H; the corre- 
sponding characteristic numbers EF, are the energy levels. We 
call this particular system the Heisenberg co-ordinate 
system, as Heisenberg tacitly employed it in his fundamental 
paper on quantum mechanics. This Heisenberg co-ordinate 
system is in general not uniquely determined; the essential 
point is the decomposition of the system space R into the 
characteristic sub-spaces R = R(E’), R” = R(E”), -+ as- 
sociated with the various characteristic numbers £’, E”, =- 
The states represented by vectors y in such a characteristic 
space are called quantum or stationary states; in them the 
energy has a sharply defined value. The cases in which H 
possesses only discrete characteristic numbers include ‘‘ con- 
ditionally periodic motion,” the only ones for which the older 
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quantum theory could be formulated. The nomenclature and 
symbolism employed in the following is adapted to discrete char- 
acteristic spectra, but this by no means precludes the possibility 
that the spectrum is entirely or partly continuous. Equation 
(8.1) becomes, on resolving it into components with respect to 
Heisenberg’s co-ordinate system, 
d 
: see + Hix, =0 


and has as solution 
Xy(t) = Xn’ e?n! (En = hvy). (8.3) 


This is an explicit formulation of the unitary transformation 
t —> g(t) = U(t)g which the state vector x undergoes in time t. 
Since |x,(t)|"is constant, the probabilities for the various energy 
values do not change in the course of time. The finite law 


X(t) = U(t)XU-X(t) (8.4) 
for the dependence of the statistical state X(t) on the time t 
is fully equivalent to the differential law (8.2). 


The mean value q = q(t) of the physical quantity represented 
by the fixed Hermitian operator Q : 


q(t) = tr [X(Z) ; Q] 


can, on taking into account the symmetry properties of the 
trace, be written also in the form 


q(t) = tr [X + Q(¢)] 
Q(t) = V-AU (A). (8.5) 


Consequently the situation can be described either by con- 
sidering Q as fixed for all time and the statistical state X(t) as 
varying with the time in accordance with the law (8.4)—and 
this is the fundamental stand taken by quantum mechanics— 
or we can take the initial state X as representing the state of 
the system for all time and allow the operator Q(t) representing 
the quantity Q to vary with time in accordance with the law 
(8.5). This latter interpretation lends itself to comparison with 
classical mechanics. (8.5) is equivalent to the differential law 


7 = HO — OH, (8.6) 


where 


82 QUANTUM THEORY 


] ) ) tant in time, i.e. the prob- 
In particular, the quantity Q is constant l } 
abilities associated with it do not change tn course of time, if the 
Hermitian form Q which represents it commutes with H. 
In Heisenberg’s co-ordinate system equation (8.5) becomes 
Emn(t) = Amn `À e — Hn — mi, (8.7) 


The matrix Q(t) is thus expressed in terms of components Poe 
forming simple oscillations with frequencies Vm — Va 2 c 
i litude i On going over from the m 
corresponding amplitude 18 mn- g ) 
to the ntt stationary state the system loses an amount h(vm — Yn 
of energy; if this energy is radiated as light, its frequency 
is given by von =, — Vp (8.8) 
Classical mechanics collects together all the transitions from 
a fixed level m to all possible levels n = 1, 2, » > + into a single 
state of motion, the motion of the system in the mtè quantum 
state, whose harmonic components have the corresponding 
transition frequencies Vm, Yma °° *. For any quantity A it 
therefore associates a constant amplitude amn with the transitiorm 
m—>n. But in classical mechanics (for systems with one degree 
of freedom) we have 
Van = k wln), k= m—n, 
instead of equation (8.8). On multiplying the two Fourier 
series A, B 
Dap « gikat and Tbr . gtkwt 
k 


we obtain the Fourier series C with coefficients 
Cr = Sarb, (r+s= k). 


Accordingly classical mechanics associates with the quantity 
C = AB the amplitudes 


Cmn = LAm, m-r' Om, m-s (7¥+s=m— n), (8.9) 
whereas quantum mechanics assigns to it the amplitudes 
Cmn = Zani bin = Lam, m-r' Omer, n- (8.10) 


The difference between these two results lies ‘in the fact that in. 
(8.9) both factors a, b have the first index m in common, whereas 
in (8.10) the first index of b is the same as the last index of a. 
This is in exact analogy with the difference between the‘ classical ** 
and the correct Ritz-Rydberg combination principle. This was 
Heisenberg’s starting-point; the correct combination principle 
indicates the pertinent.fact that the rule (8.9) for the multi- 
plication of amplitudes must be replaced by (8.10), Admittedly 
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such multiplication is not commutative, and it collects together 
amplitudes which the older model assigned to different orbits. 
We denote |amn|? as the intensity of the quantity A in the 
transition m-—> n. When multiple energy levels occur ("“ de- 
generacy ”) only the sum X |4mn|?, extended over all indices 
m for which £, = E' and all indices n for which En, = E", 
has an invariantive significance; in such a case this sum is 
taken as the intensity of A in the transition E’ > E”, If Ay 
is that portion of A in which R(E’) intersects R(E”) the sum 


defined above is the trace of A,A,. 

Consider an atom with one or more electrons and let t be 
the vector from the nucleus to a representative electron. Then 
q = er, or in case there is more than one electron the sum 
q = Jer, extended over the various electrons, is the electric 
dipole moment of the atom. In classical electrodynamics the 
intensity of the light of frequency v emitted by the atom is calculated 
from the amplitude q(v) of the harmonic components of q with 
the same frequency v in the following manner.t The rate at 
which energy flows through a surface element do at the point P, 
whose distance from the atom at O is large compared with the 
wave-length, is given by 


where q* is the component of q perpendicular to OP and dw is 


the solid angle subtended at O by do. We have further assumed 
that the wave-length under consideration is large compared with 
the radius of the atom. Since each photon of frequency v 
carries with it energy kv, we postulate that this law is to be 
taken over into quantum theory as follows : the probability 
that an atom in state n goes over into state n’ in unit time and 
emits a photon of frequency v, whose direction lies within the 
solid angle dw, is given by 


y3 
tom |?- Der ale. (8.11) 


We thus arrive at a definite rule for the calculation of the intensities 
of the lines emitted by the atom. The fact that we can now make 
such a prediction indicates a distinct superiority of the new 
theory over the old. In particular, the transition n->n' does 
not occur if the corresponding coefficient in the Hermitian form 


t By this we mean that the terms q(v)e'¥t + J(p)e-t’! occur in the harmonic 
analysis of q. 
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for q is zero. This constitutes the general selection rule. The 
connection between the state of polarization of the emitted 
light and the direction of oscillation of the electric moment 1S 
also carried over into quantum theory. But a real derivation 
of our intensity rule can naturally only be obtained by con- 
sidering the question of interaction between the atom and the 
ether; see § 13. 


Examples: 1. The Oscillator. 
The Hermitian form 


+ 00 
fæ P(x) p(x) dx, 


representing the co-ordinate x of the oscillating particle has, 
as we have already found [(3.11)], the coefficients 


_ | hn _. [hin +1) 
An, n-1 = Ima’ Qn, n+l Ona 


Gay =0 if n nli; 
(8.12% 


with respect to Heisenberg’s co-ordinate system, in which the 
energy is referred to its principal axes. We thus obtain the 


selection rule n>n + 1; the quantum number n can only change 


by + 1, the oscillator then absorbing or emitting a photon of fre- 
quency v = w and energy hw, in accordance with (3.10). The 
selection rule makes it clear why no higher harmonics are ex- 
cited in the simple oscillator. We have also found that the 
matrix || Pan'||, which represents the linear momentum in Heisen- 
berg’s co-ordinate system, is given by (3.12) 


l ihmwn l /hmw(n +1) 

Pana = — nee Pn n- = P ea] | (8.13) 
Pav = 0 for nnl 

2. Electron in spherically symmetric field. 

The result (4.7) for surface harmonics yields the selection rule 


l—>l4l (8.14) 
for the azimuthal quantum number L: for 1 = 0 only the transitior 
0— lis possible. On introducing the magnetic quantum number 
m as in §4, the characteristic functions $™ depend on the 


meridian angle ¢ about the z-axis only in the multiplicative 
factor e't = here 


x + iy =r sin ĝ-e*® g= r cos 8, 


Fecal e 
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In order to obtain the dependence of the matrices qe + igy, 
Qa — My, qz on the transition m- m’ we must evaluate the 
integral 


felag) e(— md) e(m'd) dd, 


where a= 1, — 1, 0, respectively. The integral vanishes 
unless m’ +- a =m. The only components of qs + 1g, which do 
not vanish are those corresponding to the transitions m > m — 1 
in which the magnetic quantum number decreases by 1; for 
le — Gy M>m+1; for qa m—>m. 

This last selection rule cannot be obtained from the spectra 
themselves as long as the terms corresponding to different 
values of m (|m| Sl) coincide. But these terms are broken 
up into their various components by a homogeneous magnetic 
field in the direction of the z-axis (Zeeman effect). On “ longi- 
tudinal '’ observation of the light emitted in the z-direction we 
find instead of the one line (n, l) —> (n’, V) several left- and right- 
circularly polarized components, the former of which arise from 
the transitions m —— m — 1 and the latter from m —> m + 1. 
On “ transverse” observation, e.g. along the y-axis, we find 
two transverse linearly polarized lines arising from m > m +], 
and in addition a longitudinally (i.e. along the z-axis) polarized 
line corresponding to the transition m — m. (Polarization as 
here used means the direction of oscillation of the electric dipole, 
and therefore the direction of the electric field strength.) 

In the term spectrum of the alkali elements, which is, however, 
typical in this respect, even for the more complicated spectra 
of the other elements, we distinguish between several series by 
means of the letters s, p, d, f, g, +. Each series consists of 
infinitely many terms which we number in the direction of 
increasing frequency by the integer n. It is found convenient 
to let n run from 1 on in the s-series, from 2 on in the p-series, 
from 3 in the d-series, etc. The values of the terms ns, np, 
nd, + + are then given by the “ hydrogen-like ” formula 


R 


TEF 


in which x = Kse, Kp, Ka, * * * is a correction term depending but 
slightly on n, the numerical value of which but rarely exceeds 
1/2 and is very close to 0 for high series (f,g,...). Only terms 
lying in neighbouring series combine to produce a line, i.e. an 
s-term combines only with a p-term, p only with s and d, d with 
p and f, etc. In particular, the transitions mp —> ls give rise 
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to the principal series, which also appears in absorption, nd —> 2p 
to the lines of the diffuse series, ns > 2p to the sharp series, 
and nf —> 3d to the Bergmann series?! . . 

The alkalies A are univalent, i.e. in chemical reactions only 
one electron, the valence electron, plays a rôle; the others, 
together with the nucleus, constitute an inert closed shell. It 
is therefore reasonable to assume that the optical spectra of 
the alkalies are caused by quantum jumps involving only this 
valence electron, while the core At remains in its normal state. 
We have seen above that hydrogen in the normal state is re- 
presented by a spherically symmetric wave function %; we 
therefore assume, disregarding the reaction of the valence 
electron on the core, that this feature of the core being ‘‘ closed "’ 
is to be expressed by ascribing spherical symmetry to it.* We 
have then to deal with the problem of an electron in a spherically 
symmetric field, which we have already discussed above. In 
accordance with the empirical combination principle and the 
theoretical selection rule for the azimuthal quantum number J, 
the s, p, d, f, + © + terms are to be taken as having l = 0, 1, 2, 3, 
e ++ respectively. n then runs from l+ 1 on in the series with 
azimuthal quantum number /, as in hydrogen.** 


§ 9. Perturbation Theory 


The problemi with which perturbation theory is concerned is 
the following: Let the energy H consist of two terms H=H+eW, 
the second of which, the perturbation term eW, is small compared 
with the first; this we express by the “ infinitesimal ’’ numerical 
constant €, of which powers higher than the first are to be 
neglected. Assume that the quantum problem for the “ un- 
perturbed system ’’ with energy H has already been solved, so 
that the Hermitian form H has already been brought into 
normal (diagonal) form, and let R’, R”, - - - be the character- 
istic spaces of H with characteristic numbers E’, E”, The 
problem is to find the solution of the equations for the ‘‘ per- 
turbed system ” with energy H. 

In order to illustrate the typical difference between degenerate 
and non-degenerate systems we first consider the system space as 
2- instead of œ -dimensional ; then 


_| 4, 0 
BG. ae 


* Why He and not H is the first closed atom is only to be understood as 
the result of a profound modification of wave mechanics ; see Chap. IV. 

** Concerning the introduction of the “true quantum number” for 
elements other than hydrogen, see Chap. IV, § 10. 


+ eW. 
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If E, + E, the unitary transformation which brings H into 
diagonal form differs from the identity only by terms of order £. 
Consequently the probabilities |x,|?, |x|? that in the pure state £ 
H has the values E, E, will change only by amounts of the 
same relative order €; they remain constant to the same ap- 
proximation with which sW may be neglected in comparison 
with H. But the situation is quite different for degenerate 
systems, for which E, = E, = E, for the principal axes of H 
are then indeterminate and this arbitrariness is expressed in 
the ‘‘ instability ’’ of the system under the influence of a per- 
turbation. We set up that normal co-ordinate system e,’, €z 
in which W assumes the diagonal form; the co-ordinate vectors 
are then also characteristic vectors of H, since E&E, = E, But 
these vectors can obviously differ arbitrarily from the original 
co-ordinate vectors ¢,, ¢,, whereas the energies Avi’, Av,’ can only 
differ from E by a term of order e. On returning to the original 
co-ordinate system we have 


My = Ay * e(— vr i) + Ayn" e(— vat), 
Xo = Ao, . e(— v'i) -+ Loo * e(— va t), 


where a, = (@ 41), M2 = (aig, Az) are two mutually per- 
pendicular vectors whose directions coincide with those of ¢,’, €z’. 
The probabilities for the two states €, ¢, vary periodically in 
time with the small beat frequency v} — v; (resonance between 
states €,, €). Quantum states with the same energy are therefore 
in resonance with one another. The magnitudes of the components 
of x in the characteristic spaces ft’, R”, + > +, ie. the probabilities 
for the various numerically different values of H remain ap- 
proximately constant under a small perturbation, but this is 
not the case for the absolute values |x,| of the individual com- 
ponents x, resolved along the axes of an arbitrary Heisenberg 
co-ordinate system of the unperturbed system. 

In accordance with the foregoing we can formulate the 
perturbation problem in two forms: I. Determine the change, 
due to the perturbation, in those states in which the energy 
H of the unperturbed system is determinate. This formulation 
has a sound physical interpretation if we consider the perturba- 
tion as acting during a time interval 4, t} We then find how the 
probabilities for the various quantum states change under the 
influence of the perturbation.» II. Determine the quantum - 
states and energy levels of the perturbed system, i.e. the char- 
acteristic values and characteristic spaces of H. We ask in 
particular how the terms are broken up and displaced under the 
perturbation, We consider II first. 
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We first decompose the Hermitian form W into two parts: 
W,+V. To the first belong those portions of W in which 
a characteristic space R, R”, + - - of H intersects itself, and to 
V those in which two different characteristic spaces intersect. 
If the characteristic values of H have but finite multiplicity 
the problem of bringing W’, that part of W in which W intersects 
itself, into diagonal form deals only with the space R’ of a finite 
number of dimensions. If R’ is not simply a one-dimensional 
space, the resonance phenomena mentioned above will appear. 
The co-ordinate system, consisting of characteristic vectors of 
H, is now. more precisely specified, for now W, also appears as 
a diagonal matrix; let E, be the characteristic values of the 
H + £W, =H, so obtained. The single term value E’ asso- 
‘ciated with R’ has in general been resolved into as many different 
characteristic values E, of Hy as there are dimensions in the 
sub-space ft’. 

The remainder V = ||v,|| of the matrix is such that v,,, = 0 
if the characteristic values Em, En of H are equal. The in- 
finitesimal unitary rotation : 


ôx = e Cx, C= ||Cmnll, 
of order £ transforms H into H + ôH where 
ôH = e(HC — CH) ~ «(HC — CH). 


On choosing this transformation in such a way that ôH = — eF, 
H = H, + eV goes over into Hy; this can be accomplished by 
choosing Emn = 0 if Emn = En and 


Umn 


Cmn En Ea Em 
otherwise. The characteristic values E, of Hy are therefore the 
energy levels of the perturbed system of energy H if we neglect terms 
of order e?. 

W, can be considered as the time mean of the perturbation 
terms, averaged over the motion of the unperturbed system. 
For by (8.7) the mean value of the element a,,,(t) of the matrix 
A(t), which represents an arbitrary physical quantity of the 
system, iS &mn or 0, according as v, =v, or not. In statistics 
angular brackets are often used to denote the mean value of 
a quantity ; we may therefore write 


Wy = <W)>, Hy = <H>. 
The solution of II naturally provides an answer to the 
question I. But it is more convenient to employ the method of 


variation of constanis for the calculation of the effect of the 
perturbation over a limited time interval—the smaller the 


THE PROBLEM OF SEVERAL BODIES 89 


constant £, the longer we may take this time interval to be. 
Assume that at time ¢ = 0 the system is in the quantum state 
0 and that the perturbation begins to act at this time; we ask 
for the probability that the system will be found in the state 
n at time t. That is, we seek that solution of the equations 
-imn Xn + + Z Wans Xm (n = 0, l, 2, s .) 


which reduces to 
= l, 4, = xX, =°'+ + = 0 
at time t = 0. Writing 
Xn = En . e(— Vat) 
the equations for &, are 


l. 
T 1 En = = 2 Wam&me lon Mend! ; 


for e = 0, &, = 0. Neglecting terms of order £ê, we can take 
the initial conditions 


0 == 1, éi panaia P To * -= 0 
as the 0 approximation; on substituting these values in the 
equation we obtain as the first approximation 


EW pno eiT — J 
Èn = SO, EH). 


On setting v — v, = v, the desired probability is 
21 — cos (vt) 
eal? = [Bal = (E) SS Layo) 


y? 
2[1 — cos (vt) ] 
= Ee ON Heal? (9.1) 
It is to be noted that in accordance with this result the probability 
of transition from state 0 to state n is determined by |Ho,|%. In 


the case of resonance (vy, = v) the transition probability in- 
creases at first with the square of ?: 


eal? = (¢) + [Wool 


§ 10. The Problem of Several Bodies. Product Space 


A physical system consisting of two particles of masses m, m’, 
co-ordinates xyz; x’ y'z’ and linear momenta p, p’, has as 
its Hamiltonian function 


Veya; xyz), (10.1) 
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where V is the potential energy. We assume, as in the older 
physics of central forces, that we are here dealing with an action 
at a distance so that the potential energy depends only on the 
simultaneous positions of the two particles. This assumption 
naturally breaks down when, in accordance with the theory of 
relativity, we take into account the finite velocity of propaga- 
tion of the disturbance, which requires the introduction of a 
field. The wave function % of the system will depend on all 
six co-ordinates xyz; x’ y's’ in addition to t; the operators 
corresponding to these functions in the domain of such functions 
ys are multiplication by *,°°*+; x’, +++, and a the linear 
momenta correspond the derivatives o 2; cons 


From (10.1) we then obtain the wave equation 


hoa, k h? 4, NE n 

We must ask for the probability that the one particle is to be found 
at a point P and, simultaneously, the other is to be found at a 
point P’. The probability density is accordingly to be computed 
for a 6-dimensional space with co-ordinates xyz; x’ y’ 2’. 
Indeed, the wave field is not to represent directly occurrences 
taking place in physical space, but is to determine the appear- 
ance at definite positions or with definite energies and momenta ; 
there is consequently nothing absurd in the fact that its medium 
is this abstract 6-dimensional configuration space. 

In order to be independent of the special procedure by which 
the scalar wave mechanics puts together two systems a, b to 
form a single system ¢, as suggested by this example involving 
the Hermitian forms representing the co-ordinates and momenta 
of the two systems, we must first discuss the multiplication of 
spaces from a purely mathematical standpoint. 

With each vector g = (x,) in a space R of m dimensions and 
each vector ) = (y,) in a space © of n dimensions there is 
associated a vector 3 =f X 4 with components 


Zik = XY k (10.3) 


in an m + n-dimensional space $ = R x ©, the product space. 
The components are here numbered by means of the index 
pair (ik) =l. The totality of vectors 3 =g X ) do not them- 
_ selves constitute a linear manifold, but their linear combinations 
fill the entire product space T. With the linear correspondences 
A ini and Bin G: 


’ , 
Ky = Day; My Vp = Py by, Vk 
i k 
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is associated a linear correspondence C = A x Bing: 
Lin Vy = 2 ari Dur, Xi Yrs 
or | 
zp = È Eny Cy = AyyDyy, [L= (ik), V = (tk’)]. 


Naturally, to this multiplication corresponds the law of com- 
position 
(A x B)(A, X By) = (AA, X BB), 


where A, A, are correspondences of ft on itself and B, B, are 
correspondences in ©. A co-ordinate system in R and one in 
© together determine a co-ordinate system in 2; if the co- 
ordinate system in R is subjected to the transformation A and 
that in © to the transformation B, then the co-ordinate system 
associated with them in Ẹ undergoes the transformation 4A x B. 
In accordance with the equation 


U(X) = Ae; Yr F Xi Ap, 


to the infinitesimal correspondence H in R, F in © corresponds 
the infinitesimal correspondence 


(Hx 1,)+ (1, x ¥) (10.4) 


in Ẹ, where 1,, 1, denote the unit matrices in R, ©, respectively. 
All of the foregoing is applicable to arbitrary vector spaces. 
When R and © are both unitary spaces, then & is also, for by 
(10.3) 

Seer = La Xs? LU Ky 
is an invariant if La,~,;, Livy, are; A X B is unitary if A and 
B are. 

Accordingly, two physical systems a and Ñ are compounded 
to form a total system ¢ as follows. The system space Ẹ of ¢ 
is R x ©, where R is the system space of a and © of b. Let 
the arbitrary physical quantity « in Ñ be represented by 
the Hermitian form A; on replacing all these forms A by 
A X 1,, where 1, is the unit form in an arbitrary space ©, there 
exist between these latter exactly the same relations as between 
the A, so that from the solution of a quantum problem in R 
there arises a solution for the corresponding problem in R x ©, 
but there exists no real distinction between the two. In the 
system ¢ obtained by composition we have therefore to as- 
sociate the Hermitian form A x 1, with a quantity « of a and 
1, x B with B of b, where A, B are the forms associated with 
a, Bin R, ©, respectively. The totality of quantities of the 
composite system ¢ is obtained by starting from the quantities 
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belonging to the component systems a and b and multiplying 
and adding them together in all possible ways. The quantities 
æ of a commute with the quantities £ of b, for 


(A x 1)(1, x B) = AX B= (1, X B)(A x 1). 


We refer to the content of these last two sentences when we 
say that ¢ consists of two kinematically independent parts a and b. 
The two systems are dynamically independent if the ener 
H of the composite system is the sum of the energies H(), H(2) 

of the partial systems : 


H = (HY x 1) + (1 x He). 


i dt ; 
The infinitesimal unitary correspondence A H in the total 


system space is then that one which is due to the infinitesimal 
di t , pi 
unitary correspondences 7° A), z HO” in the two original 
system spaces [(10.4)]. If H® and H") are both in diagonal 
form, then H is also, and the characteristic numbers are given 
by 
E= EP + BP or y=) +o [= (i) 


If we have a pure state for the total system which is repre- 
sented by the vector c of absolute value 1 and components 
Cim and if Q = ||q,|| is an arbitrary quantity in a, then the ex- 
pectation of Q in the pure state ¢ is 


CO) = Egirdir. 
This has the form (7.2) with 


A= ||av|| = [| Seater. 
A(r) is the Hermitian form 
a |E cistil? 
t 


in R. But we see from this that we are not dealing with a pure 
state in a, for a; will not in general have the form a,d». Con- 
ditions which insure a maximum of homogeneity within ¢ need 
not require a maximum in this respect within the partial system @. 
Furthermore: if the state of @ and the state of § are known, the 
state of ¢ is in general not uniquely specified, for a positive definite 
Hermitian form ||a,,, in| in the product space, which describes 
a statistical aggregate of states c, is not uniquely determined by 
the Hermitian forms 


24x, ky D Qik, sk! 
+ 
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to which it gives rise in the spaces R, ©. In this significant 
sense quantum theory subscribes to the view that “ the whole 
1s greater than the sum of its parts,” which has recently been 
raised to the status of a philosophical creed by the Vitalists 
and the Gestalt Psychologists. 

The kinematically independent parts into which a system 
can be resolved need not be spatially separated, nor need they 
even refer to different particles. We can, for example, resolve 
a single particle, whose physical quantities can all be expressed 
in terms of x, y, 2; pa, Py, pz, into three partial systems with 
fundamental quantities x, p,|y, py lz, pz For quantities 
which belong to different partial systems, for example a quantity 
which can be expressed in terms of x, p, alone and one which 
is in terms of y, py alone, commute with each other in the sense 
of matrix multiplication. 

In the perturbation theory we are usually concerned with a 
system which consists of two kinematically independent parts 
and which are almost dynamically independent. Disregarding 
the interaction eW for the moment, let hy, and hp, be the energy 
levels of the two parts, so that h(v, + p,) are the energy levels 
of the unperturbed total system. On writing in equation (9.1) 
s= (n, r) in place of 0 and s’ = (n’, r’) in place of n, whence 


= (vn + Pr) ~— (Vy + pr’) = Van! + Prrt 5 


Van! = Yn at, Pret = Pr — Prs 


we find as the probability that the total system goes over from 
the state s to the state s’ during time t : 


€\2 1 — cos (van + pryt a 
2(=) eee ant prr)! ° | W (nr, nr’) |2, (10.5) 


The probability that the first system will be found in the state 
n’ after time t, the total system having been in the state s = (nr) 


originally, is obtained from (10.5) by summation with respect 
to 7’. 


$ 11. Commutation Rules. Canonical Transformations 


The development of wave mechanics in §§ 1-3 went beyond 
the general scheme of §§ 7 and 8 in that it employed certain 
specific Hermitian forms to represent the co-ordinates and 
momenta of the particle. We are now interested in seeing how 
this can be formulated in an invariant manner, without recourse 
to any special co-ordinate system in system space. 

For the Hermitian forms q, p representing a rectangular 
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co-ordinate and its associated momentum we postulate the 
commutation rule 


h 


If the system has only one degree of freedom, these two quantities 
appear as canonical variables in classical mechanics. All physical 
quantities of the system are then functions of p and q ; in order 
to avoid complications we restrict ourselves to polynomials Í in 
p and q, and assume, in particular, that the Hamiltonian function 
H has this form. What are we to understand by the derivatives 
fp and f, of f with respect to p and q in this domain in which 
p and g are not commutative in multiplication ? We should 
in any case require that differentiation with respect to q should 
obey the following postulates : 

(1) pe = 9, QQ = 1; 

(2) (f+ 2) = fa + gq and (af), = & * Ja, where «is a number ; 

(3) S8) = fe '8 +h Ee 
We see immediately that these conditions uniquely determine 
the derivative of a polynomial f, unless they happen to lead to 


contradictions. But that they do not lead to contradictions 
can be seen from the fact that they are obeyed by the definition 


th: fa = fp — pf. (11.2) 
(1) follows immediately from the commutation rule (11.1), and 


the linearity (2) of the process is evident. (3) is proved by the 
formula 


et — pUe) = fep — pg) + (fp — fig 


which involves only the distributive and associative character 
of matrix multiplication. Similarly we can show that 


— th fy = fq — af. (11.2) 
The fundamental dynamical law gives us the equation (8.6) : 
haf _ 
ide T ae 


for any Hermitian form f. On applying this equation to p and g 
—which obviously suffices to establish the corresponding result 
for any polynomial f of p and g—and comparing it with the 
formule (11.2) applied to the particular function H, we are led 
to the familiar Hamiltonian equations of classical mechanics : 


dq d 
=H, 2=—H, (11.3) 


COMMUTATION RULES 95 


It 1s a universal trait of quantum theory to retain all the relations 
of classical physics; but whereas the latter interpreted these re- 
lations as conditions to which the values of physical quantities were 
subject in all individual cases, the former interprets them as con- 
ditions on the quantities themselves, or rather on the Hermitian 
matrices which represent them. This is the more significant 
formulation which the new quantum theory has given Bohr’s 
correspondence principle. 

The commutation rule (11.1) is of a rather remarkable 
nature. It is entirely impossible for matrices in a space of a 
finite number of dimensions, and it alone precludes the possi- 
bility that in an co-dimensional space q (or p) have only a discrete 
spectrum of characteristic numbers. For on referring q to its 
principal axes 


q = ||@mn||, Inn =n» Imn =O (men); p= ||Pmall, 


the left side of the commutation rule has the components 
Pmn({n — Ym); hence the main diagonal consists of nothing 
but zeros! The question arises as to whether it can be con- 
cluded from (11.1) alone that the forms representing q and p 
can always be given the form 


+ 00 “+ 00 


l x Ble) w(x) dx, | Bix) tH dx 


— OO — 00 

for an arbitrary vector % with components (x) on employing 
an appropriate co-ordinate system in system space. We shall 
see in Chap. IV, § 15, that, on introducing a certain irreducibility 
condition, this is in fact the case. 

On taking into account the three space co-ordinates g, and 
their associated linear momenta p, («= 1, 2, 3), we have in 
place of the one commutation rule (11.1) the following : 8 


Pape — Peba = 9, Gade — 4p9a = O for all a, B; 
(11.4) 


h l == 
Pate ate = Fhen amdo (ESB. 


The same commutation rules apply to the case in which we have 
several particles, the only difference being that then « runs 
through 6, 9, + + + values, according to the number of particles, 
instead of 3. These commutation rules are the necessary and 
sufficient condition that the dynamical law, which governs the 
time rate of change of the state vector x in system space, leads 
to the Hamiltonian equations for the ‘ canonical variables ” 
Ja, Pa representing the co-ordinates and associated momenta of 
the various particles composing the physical system—whatever 
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the dependence of the Hamiltonian function H on these quantities 
may be. 


In classical mechanics the Hamiltonian equations are invariant 
with respect to canonical transformations.“ In a system of 
f degrees of freedom the transition from a set of variables g,, Pæ, 
describing the state to a set qu p, («=1,2,:°+:+, flisa 
canonical transformation if the difference 


Spadla — HP aGa (11.5) 


is a total differential. If, for example, the g, are subjected to 
a transformation 


gs = Pella te Y) 


among themselves, the p, must transform as the components 
of a “ covariant vector’’ in g-space in order that the whole be 
a canonical transformation (“ extended point transformation ’”) : 
, dps 
Pa Fo 2 2g, Pe 
Perhaps the simplest canonical transformation is that in which 
the rôles of q and p are interchanged : 


pa = — la Gu = Ppa 


The canonical transformations constitute a group [cf. ITI, § 1]. For 
the identity, i.e. the transition from (p, q) to (p, q), is a canonical 
transformation; the inverse (p’, g) —> (p, q) of a canonical 
transformation (p, q) —> (p’, g’) is also canonical; and from the 
canonical transformations (p, q) —> (p’, g), (p', g) —> (p”, q") 
it follows that the resultant transformation (p, q) —> (p”, g”) 
is also canonical, for if 


Spadqa — DPlde Ledge — S padgy 
are total differentials their sum 


2 p 21 in PA p oY 


is also. 

An infinitesimal canonical transformation is one in which 
b’, 7 differ infinitely little from p, q. We can consider it as 
an infinitesimal deformation of the 2f-dimensional (p, q)-space 
which takes place in the infinitesimal time interval e = ôt, We 
introduce the components èp, ôq of the displacement vector by 
means of the equations 


Pa — Pa = 8° 8h, Ga — Gy = ESQ, ° 
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Since (11.5) must be a total differential, 
XP 24g, + Og, dp, = dT (11.6) 


must also; in our case T must differ only infinitesimally from 
Poe. We may therefore write 


T= Dx la — eS ; 


te 


considering S as a function of p, and g’, we have, in accordance 


with (11.6), 
td O dS , oS 
Pa = Pa — EF Ja = du — ESD, 
or 
dS dS 
Ò S me n cet ate 
Pa da oq, dba (11.7) 


Since we may legitimately neglect terms of order e?, we may 
identify g, with q, on the right-hand side of these equations. 
We call S the generating function of the infinitesimal canonical 
iransformation. 

In accordance with the Hamiltonian equations, the state 
of a system, represented by a point (p, q) in (p, ¢)-space, goes 
over into a state (b + dp, q + dq) during time di. If we follow 
this transition for æll possible initial states (p, q) we obtain an 
infinitesimal deformation of the space whose points represent 
the state of the system. The Hamiltonian equations assert that 
thts deformation ts an infinitesimal canonical transformation with 
generating function H-dt. It follows from this without any 
calculation that these equations have a significance which is 
independent of any particular choice of canonical variables. 

Now in quantum theory the Hamiltonian equations (11.3) 
assert that the state vector z in system space undergoes the 
infinitesimal unitary rotation 

dt = — ta - Hy, (8.1) 
so the infinitesimal canonical transformation of the quantities 
p, q is here obtained by subjecting the argument g in the Her- 
mitian forms representing them to the infinitesimal rotation 


e = 5: St 


We find that the increments of the quantities Pw, Ja are in fact 


E . Òga = (Sau — qa) E. dp, = “(SP — Pw), 
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and, in virtue of the commutation relations (11.4), this agrees 
exactly with (11.7). On generating a finite canonical trans- 
formation by the successive application of an infinity of in- 
finitesimal ones we arrive at the result that the unitary corre- 
spondences of system space on itself in quantum theory : 
g = UE 
correspond to the canonical transformations of classical mechanics ; 
more precisely, only those for which the matrix U is expressible 
in terms of the matrices p, q, but we may for the present pass 
over the question as to whether every matrix U can be obtained, 
or at lcast arbitrarily closely approximated, in this way. Since 
the commutation rules (11.4) remain unchanged under rotations 
of the normal co-ordinate system, they are valid for an arbitrary 
set of canonical variables. This is also evident from the fact 
that they are the conditions that the dynamical law (8.1) lead 
' to the Hamiltonian equations 
Mees Cleric (11.3) 
dt dP dt dla 

The general procedure for the quantum mechanical treat- 
ment of a physical system suffers from the disagreeable fact 
that the expression for the energy in terms of the canonical 
variables must be taken from the classical model, and in ad- 
dition the transition to quantum mechanics is even then not 
unique, for the model offers no means of telling whether a 
monomial such as p*g is to be interpreted as p24, pgp, qp? or 
a linear combination of all three [cf. IV, § 14]. The provisional 
character of such a procedure is clear, but the results so far ob- 
tained seem to justify the hope that the path we have entered 
upon will lead to a unique formulation of the laws governing 
the actual physical phenomena. We need then concern our- 
selves longer with the general mechanical scheme. 


§ 12. Motion of a Particle in an Electro-magnetic 
Field. Zeeman Effect and Stark Effect 


Let the spatial co-ordinates xyz now be denoted by x; Xz Xs 
and the time t by x. Ifd@is the scalar and c Y the vector potential 
of the electro-magnetic field, then in the theory of relativity 


(— 4, Az, Ay, We) = (bo, br, Pa, $s) 


are the components of a vector in the space dual to the 4-di- 
mensional world. Let 
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Fio Fæ, Fso are the components of the electric field strength ©, 
c(Fes, M1, F12) the components of the magnetic field strength . 
Denoting the components of the velocity of a particle by 
V1, Ug, Us, its proper time is 
ds = Vdt? — (dx? + dx? + dx?) |c? 

= dtV1 — vj (v = vi + us + v2), 


, AXy . , 
With the world vector u* = qs $S associated the dual u, with 
components s 
Ur = u" (r= 1l, 2, 3), u = — cu 


The invariant equations of motion for a particle of mass m and 
charge — e are 


A(muy) _ 3 8 
ds = e Z Mapu 
or 
d(mu;) _ 3 O 
d T e( Fi +E Fieve) (t = 1, 2, 3). (12.1) 
The right-hand side is in fact the ponderomotive force 
l 
— e(¢ + [08] ). 
These equations arise from the Hamiltonian function 
— 
H = eha + calme uA (pi + ed,)?, (12.2) 


in which ¥1X2%3; PıPaps are the canonical variables. In fact, 
the Hamiltonian equations 
dx; IH clp; -+ e¢;) 
Vi EE memen ee Sern 
yield 
Di + eb; = mu; ; 
in the remaining equations 
dp, dH of 2 + sfe Pee 
dX; k=l Ox; VA 


the left-hand side is 
— d(mu;) — „P: + : dpi . on 


at WX kml OX, 
But this is the desired equation (12.1): 
, 3 , 
dima) _ ef (h 28) 5 E (2b _ i)o). 


dt OX; 0% k=} ox; XK 


100 QUANTUM THEORY 


The negative energy — H is the time component p, of the dual 
vector whose space components are the components of linear 


momentum p = (pı, ps, ps), so the equation (12.2) can be written 
in the rational form 


3 , 
YP + egy)? -2 (pi + ed;)? — m2c2. 


From this we obtain the simple rule: The influence of an electro- 
magnetic field on a particle of charge — e can be expressed by re- 
placing Da by Pa + eb, in the equations of motion for a free particle. 

On going over to quantum theory p, becomes the operator 


h d 


7 on and is contragradient to the 4-dimensional displacement 
1 OX, 


dx,, as is seen from the equation 
— p% 
dy = È A 
Our rule is now: On introducing a field of potential ¢, 


> d te 
T must be replaced by va + 7 Pa (12.3) 


in the wave equation of the particle. Only $y has a simple physical 
significance; it is therefore to be assumed that the laws which 
govern # remain invariant on replacing % by et- p, where A is 
any real function of position in space-time. On the other hand, 
in the classical theory of the electro-magnetic field only the 
field strengths, and. not the potentials, have an objective signifi- 


cance, i.e. the laws are invariant on replacing ¢, by ¢, — w 

a 
where p is also an arbitrary function of the x, On examining 
our wave equation for these invariantive properties we find 
that it is not invariant under each of them separately, but that 
there must exist a certain relation between A and u. The field 
equations for the potentials p and d of the material and electro- 
magnetic waves are invariant under the simultaneous replacement 


of 
ý by ett ah and Pa by fa — =- — 


here À is an arbitrary function of the space-time co-ordinates. 
This “ principle of gauge invariance” is quite analogous to that 
previously set up by the author, on speculative grounds, in 
order to arrive at a unified theory of gravitation and electricity. ?5 
But I now believe that this gauge invariance does not tie to- 
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gether electricity and gravitation, but rather electricity and 
matter in the manner described above. We shall discuss this 
principle more thoroughly in Chap. IV; its significance and 
its interpretation will then be more apparent. 

On passing to the limit c-> oo in (12.2), after separating 
out the factor mc*, we return to ordinary mechanics: 


1 
H = ehi + Saga Pi + Gi)”. 


On neglecting terms which are quadratic in the ¢;, we find, in 
addition to the kinetic energy Xp?/2m, the potential 
i 


V = — ef + = (pM). (12.4) 


We have already made use of the first part, that due to the 
electric field, in § 5. If we have, in addition to the field originat- 
ing in the nucleus, a homogeneous electro-static field in the 


direction of the z-axis and of strength F, for which dé = — F-g, 
it adds the perturbation term 
W = eh +2 


to the energy. A homogeneous static magnetic field Q is 
obtained from the vector potential c Y = 5 (Ot), r= (x, y, z); 
this adds to the energy the perturbation term 


5 (PIG) = z4 (iplġ), 


— Ine 


(12.5) 


Zeeman Effect.—If the homogeneous magnetic field strength, 
of magnitude |Q|, is in the direction of the z-axis, the per- 
turbation term 1s 


W = ho: Lẹ o= a (12.6) 


On choosing the characteristic functions { as our co-ordinate 
system in the system space of the functions %, W, as well as 


the energy of the unperturbed atom, is in diagonal form; in 
the state defined by nl, m it has the value 


ho: m. (12.7) 
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The components (nl, m) > (nT, m’), consistent with the selection 
rule for m, into which the line with frequency v = 7 (En — En) 


is broken up give rise to but three lines: one corresponding to 
all the transitions m—>m, which 1s linearly polarized in the 
direction of the z-axis and is undisplaced - one which is circularly 
polarized perpendicular to the z-axis, the frequency v of which 
is displaced by +o (m—> m — 1) ; and one which is circularly 
polarized in the opposite sense, with frequency v — o instead 
of y (m->m-+1). This normal Zeeman effect is found only 
in the so-called singlet lines. l 

Stark Effect.—In accordance with the general perturbation 
theory, the displacement and resolution of terms in the presence 
of a homogeneous electric field is determined, to terms of first 
order, by the matrix 


ef + <27. 


In consequence of the selection rule) >/ + 1, <2) = 0, unless 
accidentally all energy levels whose azimuthal quantum numbers 
differ by 1 coincide. Ignoring this exceptional case, we should 
expect to find no 1 order perturbation effect increasing linearly 
with the field strength F (linear Stark effect), but only a quadratic 
effect, which is much smaller. This is in agreement with the 
experimental data on alkali atoms. Hydrogen is, however, 
degenerate, since for it energy levels with the same principal 
quantum number n and = 0, 1, >+, 2—1 coincide. The 
calculations for this case have been carried out by Schridinger 
and compared with experiment.” 


§ 13. Atom in Interaction with Radiation 


Following Feans, black body radiation is mathematically 
equivalent to a system of infinitely many oscillators. Maxwell’s 
equations for the free ether are 


div ģ = 0, cul 6+ +22 — 0, 


div © = 0, STEHE 


In order to simplify the relations, we assume that the walls of 
the radiation cavity of volume V are reflecting; then © is 
perpendicular to the walls at the boundaries of the cavity. 
Since the black body is at rest it is of no particular advantage 
to carry through the calculation in a relativistically invariant 
manner; we may therefore normalize the vector potential 
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cÙ in such a way that the scalar potential vanishes. We then 
have € = — = and the equations in the first row are satisfied 


by H = c+curl N; the equations in the second row become 
1A 

On the boundary % is normal to the walls. Let the characteristic 

numbers and characteristic functions of the equations 


2 
AN +a = 0, div A= 0, 
with the boundary condition that YÑ is there normal, be denoted 
by 
V == Px (= 0), Wa [x= 1, 2, 3, T sg, 
normalized in accordance with 
f LAAI = 4rôap. 


y 
On setting 


A = X g A, 


where the coefficients g* depend on time but not on position, we 
find for them the equations 


d?o” 
a + pig = 0. 


X 
Introducing i p* in addition to the q*, this equation is 


that for an oscillator with Hamiltonian function 
l a 1 a 
H, = 3(p°)? + pag"); 
we readily find on applying 
E — SpA, H= ce g cun Aa 


that the energy of the radiation field is in fact given by 


H = 2 (E+ Oa = SHa; 
Sar o 
V 
with this we have proved the theorem due to Jeans. For high 
frequencies p there are approximately 
V p*dp 
n?e? 


(13.1) 
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modes of oscillation in the frequency interval p, p + dp.7 We 
are interested above all in the limiting case of an infinitely large 
cavity ; the spectrum then becomes continuous and our formula 
for the density of frequencies becomes exact. 

On quantizing this mechanical system of infinitely many 
oscillators 8 in accordance with the theory of the oscillator (§ 3) 
and the process of composition (§ 10—but cf. remark on p. 109), 
we find as possible quantum states s, each of which is characterized 
by the fact that in it there is associated with each index « an 
integer na 20. In this quantum state 


A, = hpa( ma + 5) 


or, on choosing the additive constant in the energy in such a 
way that the lowest energy value which the black body radiation 
is capable of assuming is 0, 


Ha = Ne * h Pa, H = Fns’ hpa 
In the language of photons this means that when the cavity 


is in the state s it contains ną photons of each kind « The 
matrix element 


qasr [s = m, %,° "t Aat, s' = Ny, Me, ° a) na, ° ] 
vanishes unless all the equations 
ni = M, My = M2, Mg = M," 
hold with the exception of n, = nw, which is to be replaced by 
Nn =n, +1 or ni=n, — 1. 
In the first case we have, by eq. (8.12), 


ev = Ni Mre t1) (Emission), (13.2) 
Pa 
and in the second 
Us = ae (Absorption), (13:2) 
Pa 


The first transition s —> s’ consists in a photon of kind « springing 
into being, the second in the disappearance of one such photon. 
It follows from the above that in a transition for which q%, + 0 
all other gł, must vanish. 

Let an atom with fixed nucleus and electric dipole moment q 
interact with the radiation field. Differentiate the quantum 
states of the atom from one another by means of the index n 
and denote the corresponding energies by hv, ; then q = ||qnn'||. 
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A quantum state of the total system consisting of both atom 
and radiation is characterized by the quantum numbers 


mM, My, Na,” © *) Ngy?” * 

The effect of the radiation on the atom is, in accordance with 
eq. (12.4) of the preceding paragraph, given to a first approxima- 
tion by the perturbation term 
: eW = (qù). 

It can be shown that the addition of such a term to the 
Hamiltonian function of the total system will, according to 
classical theory, not only indicate an influence exerted on the 
atom by the radiation field, but will also modify the equations 
of Maxwell in a way which indicates that the motion of the 
electrons in the atom affects the radiation field. The per- 
turbation term will accordingly call forth emission as well as 
absorption. To a sufficient approximation we may take for A 
its value at the point occupied by the nucleus, provided we restrict 
ourselves to radiation whose wave-length 1s large compared with 
the dimensions of the atom. We now have 


eW = EGA). (13.3) 


From this it follows than an element €* Wns, nor can only differ 
from 0 if s and s’ are such that all n = ng with the exception 
of a single one ni, which must equal n, +1. Then only the 
at term contributes to the sum (13.3), and we have 


EW ns nist (Gane N) . Grete . (13.4) 


Bohr’s frequency condition, which asserts that the emission or 
absorption of a photon in state « with energy hp, is associated 
with a quantum jump of the atom in which an amount 
+ hv, — vq’) = hp, of energy is lost or won, need by no means 
be satisfied here. The finite cavity has its own frequencies p,, 
and may therefore be in no position to take up the frequencies 
associated with the quantum jumps of the atom. This is true 
in principle, but as a matter of fact, as we shall see, Bohr’s 
frequency condition is fulfilled to a very close approximation in 
the overwhelming majority of all transitions ; and this is more 
and more the case the larger the cavity is. 
Let the atom be in the state and the radiation in the 
state s = {n,}. We set | 
Dh ny Pa = V > U(pi)dp, (13.5) 
where the sum on the left is to be extended over those indices « 


for which p, lies between p and p + dp; hence U(p)dp is the 
energy density of the radiation contained in the frequency 
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range p, p+ 4p. In accordance with (10.5), the probability 
that the atom will find itself in the state w’ after time ¢ is given 
51608 n+ pee) 
— COS (Van! Psa’) y 
h? (Van + Psa’)? 7 | EW na, nie! (13.6) 

The contribution to this sum due to the cases in which a photon 
is emitted is, in accordance with equations (13.2), (13.4), given 
by 

2 1 — cos (Mant — Palt  h(na + 1) 

Fy es oe Pal yn | (Gan Wa) |%, (13°86) 
and that for absorption by 


2 ,1— cos (Van + Paji htt pe ) a 
h? 2 (Van! + Pa)? 2Px (inn 2.) (13.64) 

Consider first the case in which the term level v,, is higher than 
Vat Van! = Vn — Vw = — v is then negative. We now collect 
together all those terms « in the sum (13-6,) for which p, lies 
between p and p + dp. Since the position of the atom is not 
exactly fixed—even in consequence of the variations caused by 
the emission of photons—we may, for small wave-lengths, 
replace WZ by its mean value 4r/V as given by the normalizing 


equation [uzay = ár, and we may also assume that all 


directions are equally probable for W, The square |(W, q)|* of 
the scalar product of Y with a fixed vector q has then the mean 


value a q|2. (13.64) then becomes 


1 — cos (p — v)t 4a |Gnn|*, Zh na pa 
(p — v)? 3 y 2p? 
On introducing (13.5) the sum (13.6,) may, to a good approxima- 
tion, be replaced by the integral 
tor Gault f 1 = eos (p = vÀ, Ul 
3 h [p — v) p 
Essentially the only elements which contribute to the value of 
this integral, for a time ¢ large in comparison with the duration 1/v 
of an oscillation, are those for which plies near tov. On developing 


Ulp) _ Ul 
Up) Us... 
in powers of p — v, the first term in the expansion contributes 
+ 0 
U(v) l — cos x U (v) 
S a | a m ot (13.7) 


— © 
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to the integral; all others are to be neglected. Similarly the 
entire amount (13.6,) due to emission is negligible, for its de- 
nominator (p + v)? vanishes nowhere. This means that the 
transition is almost invariably associated with the absorption of 
a photon whose frequency lies very close to v. The probability 
that the atom will appear in the higher state n’ after lapse of 
time ¢ increases in proportion with t; the factor 

An? Ul(v) |. drr? 

3) a ° lnn]? -= 3h ' U(r) |dan'|? 


is the probability that the transition n > n' take place in unit time. 

This formula was obtained for the case in which the state 
n’ possessed a higher energy level than n. In the reverse case 
only the sum (13.6,) due to emissions contributes an appreciable 
amount. We now put vay = Vn — Yw = v and obtain the same 
formula with this difference: in place of ną we now have n, + 1, 
or in place of the sum (13.5) the sum 


D(a + Lpa = Lhnapa + Lipa 


The first is V - U(p)dp, and we denote the second by V + u(p)dp. 
This latter is equal to (hp) times the number of modes of vibra- 


tion of the cavity within the frequency interval p, p + dp; hence 
by (13.1) 


hv’ 


°C 


hp'd 
V - ulp)dp = VF, 


rapranta 


The probability that the atom drop from state n into the lower state 
n'in unit time is given by 


4r? 


572 *(U(v) + u(v)]|danl. 


The additional term u(v) is characteristic for spontaneous 
emission. When the radiation is not enclosed in a black 
body, i.e. when there is no radiation density U(v), the proba- 
bility that the atom drop from the state n to the lower state n’ 
in unit time, emitting thereby a photon whose frequency lies 
in the immediate neighbourhood of v = vya — vy, 1S 

4y3 
37a ` [annl 


This agrees with the formula obtained by integrating (8.11) 
over all directions. The probability that the atom jump from 
the level n into a higher level 1’ (vw > Vn) under these same 
conditions is zero. 
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In the energy field of the black body radiation we find not 
only absorption, but also ‘‘ stimulated emission,” both of which 
are proportional to the energy density U(v). On setting 


Arr? 
Ann! = 372 Gan’ |? ) (13.8) 


the probability for a jump from state n to a higher state n’ in 
unit time is 


Wan = Ann * U(r) | (v = Var — va), (13.9) 


and the probability for the inverse jump, the drop from n’ 
to n, is 


Wan = Anm{U(v) + u(r)] | l (13.9) 
Since ||qnq/|| is an Hermitian matrix, 
[Aan = An'n]. (13.10) 


If there are a number of atoms in the radiation field and the 
whole system is in a steady state, then on the average as many 
atoms must make the jump n —> n’ in unit time as make the 
inverse jump n’->n. On denoting the number of atoms in 
the state n by N,, these considerations are expressed in the 
condition 


Aan? N,U(v) = Anm Na[U(v) + u(v)) 


or n 6) 
n u(y 


The probability coefficients A, = A, have entirely dis- 
appeared—or rather, almost entirely, for the equation is valid 
only under the assumption that Anw +0 or Gan +0, i.e. 
the transition nn’ is not to be forbidden by the selection 
rules. But for such a system in thermal equilibrium N, must, 
as shown by Boltzmann, be proportional to 


e-Enlkt — givið 
? 


where @ is the temperature and k the Boltzmann constant. 
Equation (13.11) then becomes 


ehon = vnd g u(v) 


(v) 
or the Planck radiation formula: 
u 
ae wl 
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this formula is valid for all frequencies v whose energies can be 
exchanged by the absorbing and emitting atoms in accordance 
with Bohr’s frequency condition.”® 

We have thus finally returned to the historical origin of the 
quantum theory. We must now add three remarks concerning 
this treatment, due to Dirac, of energy exchange between matter 
and radiation. In the first place, it is able to explain the fact 
that the spectral lines are not sharp, but possess a natural breadth.*° 
Secondly, we must inquire what causes this difference between 
absorption and emission, processes which are transformed into 
each other on changing the direction of time. Indeed, the 
fundamental mechanical and field laws are invariant under the 
transformation t-> — t! The answer is that thts difference is 
due to the preferential direction in time involved in the application 
of the theory of probability ; we assume a fixed initial state and 
calculate, with the aid of transition probabilities, the distribu- 
tion over the various states at a later time, not the distribution 
which would result from the equations for an earlier time. If 
no assumption is made concerning this preferential direction, 
t should be replaced by |t| in (13-7). And finally, the fact that 
we have here treated Maxwell’s equations as classical equations 
of motion, and as such have subjected them to the process of 
quantization, may give rise to serious doubts—for in our general 
formulation Maxwell’s equations are already the quantum- 
theoretic wave equations for the photon! But we shall see 
in Chap. IV, § 11, that this method is in fact the correct one 
to employ in order to go from one corpuscle to an indefinite 
number of corpuscles. For since the number of photons must 
remain indefinite—as a photon can, in contrast to an electron, 
spring into being or disappear—the method of composition 
described in § 10 is not applicable to them. 


CHAPTER III 


GROUPS AND THEIR REPRESENTATIONS 


§1. Transformation Groups 


HE concept of a group, one of the oldest and most 
profound of mathematical concepts, was obtained by 


abstraction from that of a group of transformations. 

A point-field, a domain of elements which we call points, 
on which the transformations opcrate, underlies the trans- 
formations. This point-field may be either the totality of a 
finite number of individually exhibited elements or an infinite 
set, in particular a continuum such as space or time. A 
mapping or correspondence S of the point-field on itself is 
determined by a law which associates with each point p of the 
field a point p’ as image: p— p’ = Sp; two correspondences 
Sp and Tp are identical if for all points p the two image 
points Sp and Tp coincide. If the point-field contains a finite 
number of elements the correspondence S can be defined by 
giving explicitly the image point for each point p; for infinite 
sets, however, the association is only possible by giving the 
law of the function S. 

Among such correspondences there is a particular one which 
associates with each point p the point p itself: pop; it is 
called the identity I. Two correspondences can be applied 
successively : if the first sends the arbitrary point p into p’ = Sp, 
the second p’ into p” = Tp’, then the correspondence resulting 
from the composition of the two is defined by the association 
p — p” = T(Sp) and is denoted by TS (read from right to left !). 
The resultant correspondence depends on the order of the two 
factors S and T. In order that composition be possible it is 
essential that the correspondences are ones which map the 
point-field on itself, and not on another point-field. 

We shall restrict ourselves to one-to-one correspondences S: 
the image points p’ = Sp associated with p shall always be 
distinct, and each given point p’ shall appear as the image of 
one (and only one) of the points p. Consequently such a one-to- 
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one correspondence S: p— p’ determines a second, the inverse 
“1: p’— p of S, which just cancels it: 


Ss" 9p) = TP S(S'p’) =p or 

SS = I, SS= I, 
The inverse of S7? is again S and the identity J is its own inverse. 
The resultant T.S of two one-to-one correspondences S, T is 
itself one-to-one, and its inverse is (TS) = $-!7-! —for 
on inverting the correspondences p —> p'—> p” there results 
p” — p’—> p. Henceforth we shall consider only those corre- 
spondences, also called transformations or substitutions, which 
are one-to-one. In this domain we have, in accordance with 
what has been said, the two fundamental operations of inversion 
and composition. 

Examples.—1. Let the point-field consist of n elements 

exhibited individually ; bring them into a particular order by 
numbering them with the integers 


L2-+sn. (1.1) 


This numbering consists in a one-to-one reciprocal relation 
between the elements of the point-field and the integers or 
possible “ positions ’’ g in the series (1.1). A permutation con- 
sists in the transition from one such arrangement to another. 
If we wish to operate in space we may think of the positions as 
fixed compartments into which the movable elements can be 
laid, or, conversely, we may think of the elements as fixed and 
shift the movable numbers about. With each permutation is 
associated a one-to-one correspondence p-> p’ which tells 
which element p’ occupies, after the exchange, the position 
previously held by p. Insofar as the method of numbering is 
considered as left to convention, the permutation is nothing 
more than this one-to-one correspondence. The concept is to 
be understood in this way when we are concerned with the 
composition or successive application of permutations. 

2. A kinematical example of a group is offered by the motions 
of a space-filling substance, in particular those of a rigid body. 
The positions or numbers of the preceding example are here 
represented by the material points and the point-field 1s the 
space itself. The one-to-one correspondence p —> p’ connects 
the initial with the final state: that material point which origin- 
ally covered the spatial point p is taken to the point p’ by the 
motion. Congruent correspondences of space on to itself will 
also be briefly referred to as ‘‘ motions” in the geometrical 
sense. 

The concept of a group of transformations is now readily 
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formulated. We understand by it any system Œ of transforma- 
tions of a given point-field, which is closed in the sense of the 
following conditions : 

1. It contains the identity ; 

2. If S belongs to Œ, then its inverse S~! does also ; 

3. The resultant TS of any two transformations S, T of Œ 
is also a transformation of ©. 

As examples we name the group of all n! permutations of n 
things, the congruent mappings or ‘ motions ” of 3-dimensional 
Euclidean space, all homogeneous linear transformations in 
n variables with non-vanishing determinants (affine correspond- 
ence of an n-dimensional vector space) and the group of unitary 
transformations in n dimensions. 

If the point p goes over into p’ by means of a transformation 
of the group Œ, then p’ is said to be equivalent to p (with respect 
to the group ©). The same concept is applied when we are 
considering instead of a point p a figure consisting of points. 
Expressed in these terms, the three requirements for a group 
are nothing other than the three axioms of equality: 

1. p is equivalent to p; 

2. If p’ is equivalent to p, then p is equivalent to p’; 

3. If p’ is equivalent to p and p” to p’, then p” is equivalent 
to p. 

According to Klein's Erlanger Program ? any geometry of 
a point-field is based on a particular transformation group © 
of the field; figures which are equivalent with respect to , 
and which can therefore be carried into one another by a trans- 
formation of Œ, are to be considered as the same. In Euclidean 
geometry this rôle is played by the group of congruency trans- 
formations, consisting of the motions referred to above, and 
in affine geometry by the group of affine transformations, etc. 
The group expresses the specific isotropy or homogeneity of the 
space ; it consists of all one-to-one ‘‘ isomorphic correspondences ” 
of the space on itself, i.e. those transformations which leave 
undisturbed all objective relations between points of the space 
which can be expressed geometrically. The symmetry of a 
particular figure in such a space is described by a sub-group of 
® consisting of all transformations of @ which carry the figure 
over into itself. The art of ornamental tiling, which was per- 
fected by the Egyptians, contains implicitly considerable know- 
ledge of a group-theoretic nature; we here find, perhaps, the 
oldest fragment of mathematics in human culture. But only 
recently have we been able to formulate clearly the formal 
principles of this art; attempts in this direction were already 
made by Leonardo da Vinci, who sought to give a general and 
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systematic account of the various types of symmetry possible 
in a building. But the most wonderful symmetrical structures 
are exhibited in crystals, the symmetry of which is described 
by those congruency transformations of Euclidean space which 
bring the atomic lattices of the crystal into coincidence with 
themselves. The most important application of group theory 
to natural science heretofore has been in this field. 

The following considerations fit naturally into the present 
discussion. Let the point-field M on which the transformations 
S of the group © operate, be mapped on the point-field N by 
means of the one-to-one correspondence A: p->q; the case 
in which the correspondence serves to introduce new numbering 
or new co-ordinates is of particular importance. Through this 
correspondence A of M on N the transformation S of M becomes 
a transformation T of N ; in the particular case mentioned above 
T is simply a description of the transformation S in the new 
co-ordinates. It is evident that to the composition of trans- 
formations S corresponds the composition of the corresponding 
transformations T of N and that a group © of transformations S 
goes over into a group of transformations T. The relation 
between these two transformations is 


T = ASA} (1.2) 


for if we denote the transformation S by p —> p’ and if q, q' are 
the points of N associated with p, p' by A, then the transforma- 
tion q —> q' of N is effected by 


q>p>po]. 


We may also write § = AGA. In particular, these considera- 
tions apply when N and M are the same point-field. 


§ 2. Abstract Groups and their Realization 


An arbitrary number of transformations of a given point-field 
on to itself can be applied successively; we are of course not 
restricted to merely two. But when we perform this process 
step by step it is automatically reduced to a succession of com- 
positions of transformations taken two at a time: 


ABC +++ = A[B(C ++ *)]. 


This possibility of performing an extended composition in steps 
involving but two transformations at a time shows that the 
associative law 


(AB)C = A(BC) 
holds for any three transformations A, B, C. 
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The structure of a transformation group is obtained from it 
by abstraction when we allow the transformations themselves 
to degenerate into elements of an immaterial nature, retainin & 
only their individuality and the rules in accordance with which 
two given transformations are composed, in a given order, tO 
form a third. In accordance with what has been said such 
composition necessarily obeys the associative law. Perhaps it 
also obeys other universal laws, but since we have at present 
no indication of this we attempt a formulation of the abstract 
structure of the group by means of the following definitions : 

An abstract group is a system of elements within which a laze 
of composition is given such that by means of it there arises from 
any two (the same or different) elements a, b of the group, taken +74 
this order, an element ba. The following conditions shall thereby 
be satisfied : 

1l. The associative law c(ba) = (cb)a ; 

2. There shall exist an element 1, the unit element, which leaves 
an arbitrary element a unaltered on composition with it : 


la = al = a. 


3. To each element a shall exist an inverse a~' which yields 07t 

composition with it the unit element I: 
aam = aa = I. 

Such an abstract group is not to be confused with its realt- 
zation by transformations, i.e. by one-to-one correspondences of 
a given point-field. A realization consists in associating wits 
each element a of the abstract group a transformation T(a) of the 
point-field in such a way that to the composition of elements of 
the group corresponds composition of the associated transforma- 
tions : 

T (ba) = T(b)T (a). (2.1) 
It follows from this that to the unit element I corresponds the 
identity J and to inverse elements a, a`? correspond inverse 
transformations : 

T(a™) = THa). (2.2) 
The first assertion follows from the particular case 

T(a)T (1) = T(a) 
of (2.1) by left-handed composition with the reciprocal of the 
transformation T(a); (2.2) is then contained in (2.1) as the 
particular case b = a7. The realization is said to be fafthfurd 
when to distinct elements of the group correspond distinct 
transformations : 


I(a) + T(b) when a + b. 
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In accordance with the fundamental equation (2.1) the necessary 
and sufficient condition for “faithfulness” is that T(a) shall be 
the identity only if a is the unit element. For if a, b are two 
elements of the group it then follows from T(a) = T(b), i.e. 


T(a)T-*(b) = T(a)T(b“!) = T(ab“) = I 


that under these conditions ab-} = J, i.e. a= b. If the abstract 
group is obtained from a transformation group © by abstraction, 
then conversely G is a faithful realization of it. 

In the study of transformation groups we always deal with 
two manifolds, the structureless point-field and the manifold of 
group elements, the structure of which is expressed by the law 
of composition. The original problem thus resolves itself into 
two; the examination of the various group structures possible 
and the examination of the possibility of obtaining realizations 
of the given abstract group by transformations of a given point- 
field. The historical development of the subject has shown that 
it is advantageous to effect this division into two problems; 
they are of fundamentally different character and require 
fundamentally different mathematical equipment for their 
discussion. 

In accordance with our method of introducing the abstract 
group, which we henceforth refer to simply as the group, it 
serves merely to give the structure of the group; the nature of 
its elements is immaterial. This abstraction from the nature 
of the elements is expressed mathematically by the concept of 
isomorphism. If we have two groups g, g’ and there is as- 
sociated with each element a of g an element a’ of g’ in a one- 
to-one way: a È a’, such that 


(ba) = b'a', (2.3) 


then the two groups are said to be simply isomorphic. Simply 
isomorphic abstract groups offer no means of distinguishing one 
from the other. The concept of isomorphism can, of course, be 
applied to transformation groups. Two isomorphic transforma- 
tion groups can be considered as faithful representations of 
one ànd the same abstract group. A group may be isomorphic 
with itself; it is then said to be automorphic. Such an auto- 
morphism occurs when g and g’ coincide, i.e. when a one-to-one 
reciprocal association a@a’ satisfying the condition {2.3) is 
established between the elements of the group g. 

The question arises whether or not every abstract group 
possesses a faithful realization. If this were not the case the 
concept of an abstract group as developed above would be too 
broad—there would exist, in addition to the associative law, 
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other purely formal laws for the composition of transformations 
which are satisfied by every transformation group. Conversely, 
a proof of the realizability of any abstract group would tell us 
that all that can be said about the formal laws for the com- 
position of transformations is contained in our conditions (1) 
to (3). We can, in fact, construct a faithful realization of any 
abstract group g by taking as the point-field the group manifold 
itself and letting correspond to each element a of the group 
the transformation 
S —> S =as 


of the group manifold on to itself. This “ left-translation ” 
ta is obviously a one-to-one reciprocal transformation which 
has as inverse the transformation s=a™ 1s’. If a and b are 
distinct elements the corresponding transformations ta, t are 
distinct, for they allow the unit element I to correspond to the 
distinct elements a, b respectively. If we perform in succession 
two left-translations 


s>s'=as, S —> s" = bs 


the resulting transformation is, in consequence of the associative 
law, 
s — s” = b(as) = (bajs. 


Consequently the left-translations constitute in fact a faithful 
realization of the abstract group. However, the right-trans- 
lations behave otherwise, for if we denote the mapping 
s—> s' = sa of the group manifold on itself by t*(a), we find 
instead of (2.1) the equation 


t*(ba) = t*(a)t* (b). 


§ 3. Sub-groups and Conjugate Classes 


A sub-group g' of a given abstract group gis a set of elements 
contained in g which itself fulfils the characteristic group con- 
ditions: the unit element | belongs to g’, with a belongs also 
a` and with a, b also ba. These three conditions can be reduced 
to the one: if a, b are any two elements of gq’, then ba} also 
belongs to g’. We assume, of course, that the partial system 
consists not merely of the element l, but the other limiting 
case, in which g’ coincides with g, shall be included under the 
concept of a sub-group. 

Examples are readily found. In the group of Euclidean 
motions are contained, for example, the group of rotations 
(which leaves one point, the centre, fixed) and the group of 
translations. The unitary transformations constitute a sub- 
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group of the complete group of all homogeneous linear transforma- 
tions; the even permutations a sub-group of the group of all 
permutations. If we are dealing with a transformation group 
®©, all those transformations of @ which leave a particular 
point p fixed (i.e. which carry p over into itself) constitute a 
sub-group @,. Instead of a point p the fixed element may be 
any figure composed of points; the transformations of the sub- 
group must either leave the figure as a whole fixed (i.e. they must 
carry each point of the figure over into another such) or the 
more restrictive condition that they leave each point of the 
figure fixed. We can also obtain sub-groups of & by employing 
invariant functions instead of invariant figures. If #(p) is any 
function of position on the point-field with elements p we as- 
sociate with the transformation S:p— >’ the function yw’ 
defined by '(p’) = ẹ(p) and say that it is obtained from # by 
the transformation S. If p’ = Sp, p” = Tp’, the equations 
pe) = p'e) = pp”) 
show that the composition of the transitions %—> W% and 
us’ — ob” associated with S and T result in the transition y > y” 
associated with TS. Now consider all transformations S of @ 
which carry ¢&(p) over into itself, i.e. for which 4(Sp) = (p) is 
an identity in p; they constitute a sub-group © of G, and 
pp) is an invariant of H. In this way we can separate out 
the rotations from the homogeneous linear transformations by 
requiring the invariance of the unit quadratic form. The sub- 
groups contained in a finite group g, which is described by 
exhibiting each of its elements and giving explicitly the result 
of composition of each two, can be obtained by inspection. 
There is associated with each element a of the group g a 
cyclic sub-group denoted by (a): 


É a? a, a= L a, ats (3.1) 
the elements a” of which are defined inductively by the equations 
a? = [, a = ara, 

These elements constitute in fact a group, for n and m being 
any integral exponents we have 

qutm = a'a. 
(a) is the smallest sub-group which contains a, i.e. its elements 
are common to all sub-groups of g which contain a. The 
elements of the set (3.1) can either be distinct or—and this 
latter must be the case if g is a finite group—they must repeat 


themselves after a cycle of h terms: l, a, a, +--+, al are 
distinct but a? = |. his called the order of the element a. 
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The order of a finite group is the number of its elements ; 
accordingly, the order of an element a agrees with the order of 
the cyclic sub-group (a) generated by a. A group is said to be 
commutative or Abelian if composition of its elements obeys 
the rule ba = ab. Cyclic groups are therefore Abelian. 

If a runs through the sub-group Ñ of g the associated (left-) 
translations t, constitute a group of transformations which is 
simply isomorphic with H, the point-field of which is the group 
manifold. We say that two elements s, s’ which are equivalent 
with respect to this transformation group are (left-)equivalent 
with respect to ) and express this situation by the notation 
“s'=s with respect to }’’; the condition for it is that s’ = as 
where a is an element of §. In this way the elements of g are 
divided into sets of elements which are equivalent to }. If 
the number of such sets is finite, it is called the index of h in g. 
If g is a finite group the number of elements in each of these 
sets is given by the order of §, for different translations t, send 
s into different elements: as + bs if a + b. The order of } is 
accordingly a divisor of ihe order of g, and the quotient of these two 
is the index of ¥). 

The considerations at the end of §2 above, which were 
developed for groups of transformations, suggest a second 
realization of the abstract group g. We associate with the 
element a the correspondence 


s—> Ss = asa} (3.2) 


of the group manifold on itself. This correspondence, which 
we call the ‘‘ conjugation’ fa, is reciprocal one-to-one, and has 
as inverse s = a1 s'a. The law of composition is obeyed, for 
from 

s—> s = asa, s — s” = bs'b™ 


we obtain the product 
S” = basa™t b! = (ba)s(ba)“. 


Two elements s, s’ of g are said to be conjugate if they are 
equivalent with respect to the group of all conjugations. Ac- 
cordingly, the whole group is divided into classes, any element 
of one of which is conjugate to any other element of the same 
class. When we speak of classes within a group without a 
more explicit description we mean these conjugate classes. 

The realization of g by the group of conjugations is in general 
a“ contracted ” rather than a faithful realization. In particular, 
the conjugation f, coincides with the identity if a commutes 
with all elements s of the group. The totality of all such ele- 
ments a is called the central of the group; it is obviously 
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an Abelian sub-group of g. But this disadvantage of con- 
jugation over translation is offset by an advantage ; conjugation 
is an isomorphic correspondence within the group itself which 
leaves the unit element invariant and which associates with 
each sub-group § of g another such, the conjugate sub-group 
ala“. These facts, which are expressed by the equation 


a(st\a~} = (asa™)(ata™), 


were already contained implicitly in the considerations at the 
end of § 1. § is said to be a self=conjugate or invariant 
sub-group if it coincides with all its conjugate sub-groups. 
The importance of this last concept is best seen in the 
following : 
Theorem. If his an invariant sub-group and = denotes equiva- 
lence with respect to it, then it follows from 


'=s,U=t that st =st. (3.3) 
To prove this we note that s’ = as, ť = bt (a, b in §) yield 
s't = asbt = (ac) (st). (3.4) 


for c = sbs! belongs to } with b. Since ac lies in § our assertion 
is proven. It is readily seen that the invariantive nature of } is 
necessary as well as sufficient for the validity of (3.3). In deal- 
ing with an invariant sub-group we need not distinguish 
between right and left equivalence with respect to }—1indeed, 
the above proof was based on this fact. 

We may, if we like, consider equivalent elements as not 
differing from one another (by application of the principle of 
definition by abstraction); but by thus allowing equivalent 
elements to fall together the group property of g is, in general, 
forfeited. In accordance with the above theorem it still remains, 
however, if ĵ is an invariant sub-group. The group obtained 
from g by identifying all elements which are equivalent with 
respect to ) is called the factor group g/}); its order is the 
index of the invariant sub-group } of g. | 

These concepts are of assistance in examining the way in 
which a group may be “ contracted ” on setting up a realization. 
Let the transformation T(a) of a given point-field on itself 
correspond to the element a of the abstract group g in the realiza- 
tion under consideration. Then T(a) = T(a’) if and only if a’ 
is obtained from a by composition with an element e (i.e. a’ = ea) 
for which T(e) is the identity. Such elements e obviously con- 
stitute a sub-group 9 of g, for it follows from 


T(e) =I, T(e)=I that T(ee’) = T(e)T(e’) = I. 


Se ee ie 
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ý is, in fact, an invariant sub-group, for if T(e) is the identity, 
the same is true of 
T(aea~!) = T(a)T(e)T-M(a) = T(a)T~ (a). 

In any realization of an abstract group g by a group of transforma- 
tions the elements of a certain invariant sub-group } of g correspond 
to the identical transformation ; two different elements will be 
associated with the same transformation if and only if they are 
equivalent with respect to }. The group of transformations ts 
consequently a faithful realization of the factor group 9/9. 


§ 4. Representation of Groups by Linear 
Transformations 


On requiring that the transformations which are to serve 
as a realization of a given abstract group g be linear and homo- 
geneous we arrive at a problem which is most fruitful from the 
mathematical standpoint and which is at the same time of 
greatest importance for quantum mechanics; we then speak of 
a representation, instead of a realization, of the group.? An 
n-dimensional representation of g, or a representation of degree 72, 
consists in associating with each element s of the group an 
affine transformation U(s) of the n-dimensional vector space 
R = R, in such a way that these transformations obey the 
law of composition : 


U(s)U(t) = U(st). (4.1) 


We then say that s induces the transformation U(s) in the 
representation space R. On choosing a definite co-ordinate 
system in R each transformation U(s) is represented by a square 
matrix of n rows and columns, the determinant of which does 
not vanish. On replacing the original co-ordinate system by 
another, obtained from it by the transformation A, the corre- 
spondence which was formerly represented by the matrix U(s) 
is now represented by the matrix AU(s)A-1. Consequently if 
the association s—> U(s) is a representation, the association 


s—> AU(s)A74 


is obviously also one; this latter representation is said to be 
equivalent to the former. They are essentially the same, 
differing only in the choice of the co-ordinate system in terms 
of which they are described. 

Examples—A representation in one dimension consists in 
assigning to each element s of the group a non-vanishing number 
x(s) in such a way that 


x(st) = x(s) x(2). (4.2) 


k toa o ene ao 
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In particular, (I) = 1. A most trivial l-dimensional repre- 
sentation is obtained by assigning to each s the number 1: 
x(s)= 1. This special case is called the identical representation. 

Consider next the so-called symmetric group, the group 
m = my of all f! permutations of f things. The association 


s —> ðs = +1, 


according as s is an even or an odd permutation, defines a 
1-dimensional representation, the ‘‘ alternating ’’ representation 
of the group m. For the character 6,, which distinguishes 
between the even and the odd permutations, satisfies the 
equation 

851 = 6,° by. 


Let g be a finite cyclical group of order h; the elements 
s are then 7 
I, a, a’, e.. a-t 


and a, = }. Consider the l-dimensional representation s —> x(s) 
in which y(a) = e. The condition (4.2) for a representation 
then tells us that to the elements s of this series correspond 


2 s. à ne 
J, £, £f, , hod, 


and that to a’ corresponds e*. Hence eê = 1; e must therefore 
be an A® root of unity and the law defining the representation 
isa" — E (r = 0, 1, 2, +--+). Conversely, when ¢ is an arbitrary 
h* root of unity this association defines a l-dimensional re- 
presentation of g. We have thus obtained a complete survey 
of all possible l-dimensional representations of a cyclical group. 

The only example of a multi-dimensional representation 
which we offer at this time is the following trivial one. If 


8 is itself a group of linear transformations of an n-dimensional 


vector space K, then the association s —> s defines an n-dimensional 
representation of g. This example implies more than one might 
at first sight imagine. We have in fact to do the following : 
we first obtain the structure of the group g by abstraction from 
the group of linear transformations and then return to the 
original realization by means of the correspondence s — s 
between an element s of the abstract group on the one hand 
and the linear transformation s on the other. 

The concept of equivalence has a more general significance 
than that discussed above. It may refer to an arbitrary system 
& of linear correspondences U of the n-dimensional vector 
space R. We need not assume that these correspondences 
possess an inverse (ie. that they have a non-vanishing deter- 
minant), nor need we assume that they are associated with 
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the elements s of a group, as is the case with representations. 
On expressing the set of correspondences U in terms of a new 
co-ordinate system each matrix U goes over into the matrix 
U’ = AUA; the system & is transformed into the equivalent 
system 2” consisting of the U’. A is here a fixed non-singular 
matrix. : 

Consider a correspondence U of R on to itself. A linear 
sub-space R of R is said to be invariant under U if the vectors 
of R are transformed into vectors of RW' by U. If R is invariant 
then the space R (mod. R’) obtained by projecting R with 
respect to Ñ’ is also invariant (cf. I, § 2, in particular Fig. 1). 
R being invariant, U gives rise to a correspondence U’ of W 
on to itself; we say that U induces U’ in R. Similarly for 
the space obtained by projection. We now pass from a single 
correspondence U to a system 2 of correspondences. R’ is 
said to be invariant under Æ if it is invariant under each corre- 
spondence U of X. Describing R in terms of a co-ordinate 
system which is adapted to the invariant sub-space R’, all 
matrices U of the system Æ reduce simultaneously to the form 
illustrated in Fig. 1, p. 8. 2 is called irreducible if R con- 
tains no sub-space, other than X itself and the space 0 consisting 
only of the vector 0, which is invariant under 2. We shall 
have occasion to reduce R in such a way that each constituent 
separated off is irreducible under a given system 2. This 
requires the construction of a series of sub-spaces 


0, Ry, Re, T.N R, SF R, (4.3) 


beginning with 0 and ending with R, in which each member 
is contained in the preceding one and is such that R; (mod. Riy) 
is irreducible. Naturally R, shall actually be larger than Riu 
not merely coincide with it. The implications of this reduction 
are most readily seen in terms of the matrices U of the corre- 
spondences of the system Æ on adapting the co-ordinate system 
to the “composition series ” (4.3), i.e. by choosing first a co- 
ordinate system in R,, then supplementing it with additional 
fundamental vectors in order to obtain a co-ordinate system 
for Rs, Ra, °° -in turn, 

2' is said to be completely reducible if R can be decomposed 
into two sub-spaces R + R, each of which are invariant under 2 
and such that neither of them consists merely of the vector 0. 
This concept of complete reducibility is more exacting than that 
of mere reducibility. On describing R in terms of a co-ordinate 
system which is adapted to this decomposition, each matrix 
U of Æ assumes the form illustrated in Fig. 2, p. 9. We are 
then faced with the problem-of decomposing R (or 2) into 
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constituents, none of which is completely reducible, i.e. of 
decomposing R = R, +--+ R, into invariant sub-spaces, 
none of which is completely reducible. 

We often find that reducibility implies complete reducibility, 
i.e. that in many cases we have the theorem: If R’ is an in- 
variant sub-space of R, a second invariant sub-space R” can 
be found such that R is completely reducible (with respect to 
X) into RW + R”. We shall soon see that this is actually the 
case when KR is a unitary space and Æ is a system of unitary 
transformations. 

It was shown in Chap. I, § 3, that if the system & is re- 
ducible, then the system 2* of ‘‘ transposed” correspondences 
of the dual space on itself is also reducible. If H:s- U(s) 
is an n-dimensional representation of the group g the transposed 
U*(s) do not constitute a representation; it 1s readily seen, 
however, that on employing instead the contragredient corre- 
spondences 


U(s) = [U*(s)] 


we do obtain a representation s —> U(s) of the dual vector space. 
This we call the contragredient representation $. 


§5. Formal Processes. Clebsch-Gordan Series 


Continuous groups offer what are perhaps the simplest 
examples of the theory of representations. We consider in 
particular the group c = c, of all linear and homogeneous trans- 
formations s in n variables %,, Xa °° ', Xa with non-vanishing 
determinants; we consider each set of values x; as a vector 
in,an n-dimensional vector space t = 1,. The classical theory 
of invariants, first developed in England about the middle of 
the last century, concerned itself in particular with the repre- 
sentations of c induced on the coefficients of arbitrary forms 
in the variables x, A quadratic form in these variables is a 
linear combination of the n(n + 1)/2 linearly independent 
products x;%,; under the influence of a linear transformation 
s of the x; these products undergo a linear transformation [5], 
and the correspondence s-> [s]a is obviously a representation 
[c]? in n(n + 1)/2 dimensions of the group c. The transformation 
s of the variables x; sends the arbitrary quadratic form 


Dip, Xi Hy 
into a quadratic form 


ld , rd 
Dy Uy Xi Xk 
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in the new variables, where the coefficients aj, are obtained 
from the a;, by a certain linear transformation S associated with 
S; Sais obviously contragredient to [s], The quadratic form 
characterized by a fixed set of n(n + 1)/2 coefficients a; may 
therefore be considered as a vector in a space of this number of 
dimensions, and the transformation s of the variables x; induces 
the transformation są in this space. The space thus defined b 
the totality of n-ary quadratic forms is thus the point-field for 
a group of linear homogeneous transformations which constitute 
a representation of the group €. 

We may in the same way deal with cubic, quartic, -° ;, 
f-ic forms. The totality of monomials of order f are contained 
in the formula 

afi age wee xJn (5.1) 


where the f; are non-negative integers whose sum 


Aith+: i "+h f 


They constitute the substratum of a representation [c]/ in 


Bee --(n+f—1) 
fi L2.. f 
dimensions. 

But we can exhibit representations of ¢ which are formally 
yet simpler than these arising from the theory of forms. Let 
(x;) and (y;) be two arbitrary vectors in our n-dimensional 
space t and consider the products x;y} On subjecting the x, 
and the y; to the same transformation s of ¢ (transition to a 
new co-ordinate system) the n? products undergo a certain 
linear transformation s X s associated with s and the corre- 
spondence s -> s X s is an n?-dimensional representation (c)* of c. 
Now a system of numbers F(i, k), depending on two indices i, k 
which run through the values 1, 2,- : -, n, is said to be a tensor 
of second order if under the influence of a transformation s of 
t the F(z, k) undergo the same transformation as the products 
Xi Yx of the components of two arbitrary vectors g, 9 of r. Hence 
the tensors of order 2 are the substratum of the representation 
(c)? of e. (e)? contains the representation [c]? which is induced 
in the sub-space of symmetric tensors of order 2; the tensor 
with components F(t, k) being symmetric if F(ik) = F(ki). 

In geometry the anti-symmetric tensors, i.e. tensors whose 
components satisfy the condition F(zk) = —- F(ki), play a more 
important rôle than the symmetric ones.” In particular, two 
arbitrary vectors (x,), (y,) define a surface element with 
components 


x {tk} = ie — HRs ; 
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of these quantities but n(n — 1)/2 are linearly independent, 
say those for which 1< k. On subjecting the components x; 
of the vector z and the components y; of the vector ) to the 
same linear transformation s, the components of the surface 
element defined by them undergo an n(n — 1)/2-dimensional 
linear transformation {s}. s— {st}, is a representation {c}? 
whose substratum is the totality of anti-symmetric tensors of 
order 2. Hence the representation (c)? is reduced into the 
representations [c]? and {c}, for any tensor F(zk) can obviously 
be written | 


Flik) = 5[F (iB) + Flki)] + S{FG) — F(ki)], 


i.e. in a unique manner as the sum of its symmetric and anti- 
symmetric parts. That this reduction is correct is further borne 
out by the fact that the dimensionalities satisfy 


a Mn+ 1), nin—1)) 
n“ = 5 + 5 . 
Similarly three arbitrary vectors z, 9, 3 determine a 3-dimen- 
sional element of volume with components 


Xi Xk Xı 
xfikd = | yi Yr Yıl. (5.2) 
Zi Bk l 


These elements constitute the substratum of a representation 


y = n n(n — 1)(n — 2) 
Gh as 


dimensions. Continuing in this way we can construct 4.-, 
5-, - - + n-dimensional elements; this process must cease with 
n-rowed determinants, for a determinant of the form (5.2) with - 
more than n rows must necessarily vanish identically. 

We shall see that the representations of c whose substrata 
are the symmetric and anti-symmetric tensors of order f are 
irreducible, and shall in fact solve the general problem of effect- 
ing the complete reductions of (c)/, the representation induced 
by c in the space of all tensors of order f, into its irreducible 
constituents (Chap. V). 

The tensor concept really depends on the X -multiplication 
introduced in II, § 10. If the m variables x; undergo a trans- 
formation A and the n variables yẹ a transformation B, then 
the mn products x, undergo a transformation A X B. Con- 
sidering the x; as the components of an arbitrary vector g in 
an m-dimensional space Rm and the y, as the components of 
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y in Rn, the products x; * Yy may be considered as the components 
of a vector r X Ņ in an mn-dimensional vector space iim X R. 
Hence two representations 


H:s—> U(s), H :s—U'(s) (5.3) 


of g in m, n-dimensions, respectively, give rise to a new mn- 
dimensional representation which we denote by © x Q’: 

H X H: s—> U(s) x U'(s). (5.4) 
This presents a general method of obtaining a new representa- 
tion © x §’ from two given representations Q, 0’. 

Denoting the representation s—> $ of the linear group c for 
the moment by (c), the representations of ¢ whose substrata 
are the tensors of order 2, 3, - ++ are then (c) X (c) = (c)?, 
(c) x (0 x (6) = (©) % °° 

We should, perhaps, have discussed the addition + of two 
representations before discussing their multiplication X. Con- 
sider the variables x; and y, as the components of a single vector 
4 in an (m + n)-dimensional vector space; when the x; are 
subjected to the transformation A and the y, to the trans- 
formation B these m + n variables undergo a certain trans- 
formation (A, B). Hence we obtain from (5.3) the representation 

Ñ x O': s— [U(s), U"(s)] 
in m + n dimensions. The inverse of this process is complete 
reduction, as discussed above: © + Q’ is completely reducible 
into the components § and 9’. 

Another important formal method is the following: Any 
representation I’ in N-dimensions of the linear group c, in 
n-dimensions may be used to construct an N-dimensional 
representation of any abstract group g from an n-dimensional 
representation © of the same. JI” associates with the linear 
transformation u in n-dimensional space a linear transformation 
U in N dimensions, so if Q: s5s-—> u is an n-dimensional repre- 
sentation of the group g with elements s, then 


S> u> U 


is an N-dimensional representation s —> U of g which we may 
denote by I'(Q). To this is due the importance of the repre- 
sentations of the linear group for the general theory of repre- 
sentations. For example, take I to be the representation of 
¢ whose substratum is the dual space, the space of all tensors 
of order 2, of the symmetric or anti-symmetric tensors of order 2, 
etc.; we then obtain from the representation § of the abstract 


group g the representation ý, O x , [H x 9], {9 x H}, etc. 


came eS teas sce 
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The three most important formal processes are (1) addition 
(2) x-multiplication, and (3) the I" process. The first two 
generate a new representation from one or two given repre- 
sentations, the third a new one from a given representation. 
The first two are completely circumscribed, but the third 
contains a general method, for I” may be any representation of 
the linear group ĉn. 

If g’ is a sub-group of g, then any representation 
© : s > U(s) of g contains a representation of 9’; we need only 
let the element s run through the sub-group g’! This too may 
be considered as a formal process (4) which generates a repre- 
sentation of g’ from a given representation of g. 

The X-multiplication occurs in yet another connection. 
Given two groups g, g’, we can consider the pairs (s, s’), the 
first member s of which is an element of g and the second s’ 
an element of g’, as the elements of a new group g X g', the 
direct product of g and q’, obeying the multiplication law 


(s, Y(t, P) = (st, st’) 


The order of g Xx g’ is the product of the orders of g and g. If 
§:s-» U(s) is an n-dimensional representation of g and 
H : s’ > U'(s’) an n’-dimensional representation of g’, then 


(s, s) + U(s) X U'(s') (5.5) 


is obviously a representation in nn’ dimensions of the group 
g xg’; we denote it by © X ’ (with a boldface x). This 
construction may be broken up into two steps. First introduce 
the representation 


(s, s’) > U(s) 


of g X g'; there is no reason why we should not designate it 
by the same letter as the representation s > U(s) of g—we are 
accustomed to calling the function f(x), considered as a function 
of the two variables x, y, by the same letter as the function 
f(x) of the single variable x. U(s) and U“(s’) are thus to be 
considered as functions of the same variable pair (s, s’), and then 
the representation © X H’ of g x g' may be obtained by ordinary 
x -multiplication from § and §’. The differentiation between 
boldface x and ordinary X is accordingly purely pedantic. 


Examples. Unimodular Group in Two Dimensions 


Let g = ¢ = ¢, consist of all linear transformations s of two 
variables x, y : 


x! = ax + by, y = cx + dy, (5.6) 
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whose determinant ad — be = 1 (‘‘ unimodular” linear trans- 
formations *). A homogeneous polynomial in x, y of order f is 
a linear combination of the f + 1 monomials 
° ad, xl y, > À "x yit, yf. (5.7) 

Under the influence of s they undergo a linear transformation 
which we denoted above by [s];; they constitute the substratum 
of a representation [c]/: s — [s] in f+ 1 dimensions which we 
now denote by Gy Œ; is, although we have yet to prove it, 
irreducible. ' 

We can restrict ourselves within ¢ to the sub-group c, of 
" principal” transformations which transform each of the 
variables separately : 


max, y= ly, (5.8) 


where a +0 is an arbitrary constant. ¢, is Abelian. This 
transformation multiplies the monomials of the set (5.7) by 


af, af-2 a œ - a~(f-2), as. 


On associating the number a" with the element (5.8) of cĉ, we 
obtain a 1-dimensional representation which we denote for the 
moment by C; here r can be any fixed integral exponent. 
We have just seen that the irreducible representation ©, of ts 
is completely reduced on restricting ourselves to the sub-group 
¢, into f+ 1 one-dimensional representations C0) with r= f, 
f—2,:+:+,—f. This is an example of the process ($). 

As an example of multiplication and addition we consider 
the problem of reducing the product Gy x ©, of the two repre- 
sentations @,, ©, of c into its irreducible components. The 
result is contained in the formula 


| C; x G,= 2b, | (5.9) 


where v runs through the series 
lu=fteft+e—%---, fe] (5.10) 


without repetition, decreasing by 2 from term to term. This 
equation is essentially identical with the Clebsch-Gordan series 
which plays such an important réle in the theory of invariants 
of binary forms. We shall see in the succeeding chapters that 
it may justly be considered as the fundamental mathematical 


* c„ will usually denote the group of all non-singular linear transformations 
in #-dimensions ; it will however occasionally be used to denote the more 
restricted unimodular group, in which case the restriction will be explicitly 
stated. 
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formula for the classification of atomic spectra and for the theory 
of the valence bond. 
The proof consists in showing that 


Oy x Gy = Cry + Opa X C-i), (5.11) 


for (5.9) then follows by mathematical induction and the fact 
that obviously 
OF x O = Cy. 


A new co-ordinate system for the representation space of Gy 
is obtained by replacing the basis (5.7) of homogeneous poly- 
nomials of order f by another basis. In this sense we can say 
that the polynomials of order f constitute the substratum of 
the representation ©; The substratum of the representation 
©, x ©, is then the totality of polynomials 


P= Dewy; En) 


depending on the components of two arbitrary vectors (xy), 
(En), homogeneous and of order f in the first, and homogeneous 
and of order g in the second ; we write the total order f + g = h. 
The ® are thus linear combinations of the (f+ l1)(g+ 1) 
monomials 


yke ne where t+k=fiitn=g. (5.12) 


Both vectors are transformed cogrediently under the same trans- 
formation s, (5.6). The problem consists in completely reducing 
the space of the polynomials ® into two sub-spaces (Ð), and 
(®)’ which are the substrata of the representations ©, and 
Gr, X ©, respectively. We first discuss the structure of 
these two sub-spaces. 

(®),. Expand 


(ast + By) ak + Br)? = ah «by + (T) AB- dh to + Bg 
(5.13) 


in powers of the undetermined coefficients «, B. The 
hı = $,(xy; én) are special polynomials of the type ® and span 
the sub-space (Ø), We must now show that this sub-space is 
invariant under the transformation (5.6) of the variables ; 
i.e. that 6; = ¢,(x’y’; En) is a linear combination of the 
$: = xy; én). It is clear that if this is the case then ¢ in- 
duces the representation ©, in (Ø), for on identifying the two 
vectors 
E=x, n=y (5.14) 
d: becomes 
giley ; xy) = whe yi. 
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Hence we are certain a priori that the h + 1 functions ¢; are 
linearly independent. l i , 
In order to arrive at the desired proof we replace x, y in 
(5.13) by 
x! = ax + by, y'= cx + ay, 
and in the same way &, 7 by 
E = af + bn, n’ = cé + dn. 
Now note that ax’ + fy’ is the linear form 
(aa + Box + (ab + Bd)y = Ax + By 
in x and y; hence 
(auc! + By’) lat + Br’)? = (Ax + By)f(AE + Brys, 
and by (5.13) 
h —1 Qi pe h h-i Ri e ch. 
F (i) AB K= g (PAB g 
On replacing A, B on the right-hand side of this equation by 
A= «a+ Be, B= ab + Ba, 
and equating coefficients of «iB, we obtain ¢; as a linear 
combination of the ġ,. . 
(®)". The substratum of the representation ©; x © 
consists of the polynomials | 
P = Phy; gn) 
of order f — 1 in (x, y) and of order g — lin (£, 7). They are not 


polynomials of type ®; in order to increase the order in the 
components of each vector by 1] we replace each such ¥ by 


P = (xq — yé) Y. 
The factor thus introduced in no way affects the representation. 
The last step in the proof consists in showing that the total 


space of polynomials ® is completely reducible into these two 


sub-spaces; i.e. in showing that any polynomial ® can be 
written in the form 


D = (apo + ah, +: + > + ands) + (xn — yé) Y (5.15) 
with unique constant coefficients a; (The development in 
terms of powers of the determinant xn — yë obtained from this 


by induction is the Clebsch-Gordan series.) First, the dimen- 
sionalities are correct, for 


F+ DEt = (fg + 1) +4. 
Hence it suffices to show that the various terms in (5.15) are 
linearly independent, i.e. that an expression of the form (5.15), 
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in which ¥ is a polynomial of order f — 1 in (x, y) and of order 
g — 1 in (é, n), can vanish only if W vanishes identically and if 
all the coefficients a; are zero. The proof is extremely simple. 
We first let (én) = (xy) as in (5.14), then the equation ® = 0 
becomes 


dox? + atty + - . -+ ant = 0 


identically in x and y; hence a; = 0. Having established this 
we return to the two sets of variables xy; & and obtain the 


equation 


from which it follows that ¥ = 0—in an algebraic identity 
for polynomials we may always remove a factor, such as 
xn — yé, which does not vanish identically. 

Our formula (5.9) also holds for the group ¢ of all linear 
transformations of x, y with non-vanishing determinant. We 
must then interpret ©,, v =  — 21 in (5.9) as that representation 
whose substratum is the totality of homogeneous polynomials 
of order v in x and y multiplied by (xy — yé). In other words, 
the new ©, differs from the old in that the transformation of 
the (v -+ 1)-dimensional representation space corresponding to 
s in the representation ©, is to be multiplied by the J“ power 
of the determinant ad — be. 

Œ; X ©, is a representation of Cy X Cy, the group consisting 
of pairs (s, s’) whose members s and s’ run independently through 
the entire group t} On introducing the restriction that s’ is 
the element S$ obtained from s by replacing the coefficients of 


the linear transformation s by their conjugate complex, ©; x Cy 


becomes a representation ©; ọ of Ca the substratum of which 
may be taken as the monomials 


xy BE (i+k=f, w+ K=g) 


of order f in (x, y) and order g in (%, ¥). It can be shown that 
Œ; , is also irreducible. 


§ 6. The Jordan-Holder Theorem and its Analogues 


Perhaps the most fundamental theorem of mathematics 1s 
that on which the concept of cardinal numbers depends. Let 
the members of a finite set of objects distinguished by marks 
a,b, c++ - be exhibited individually in this order and associated 
with the symbols 1, 2, +- n. The theorem then states that 
the “number” n is independent of the order in which the 
objects are exhibited. The proof of this theorem is of con- 
siderable mathematical interest and offers the simplest example 
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of the type of proof employed in establishing the Jordan- 
Hélder theorem. A new enumeration consists in associating 
the symbol 1 with any one of the objects, the symbol 2 with 
any one of the remaining objects, etc., until the entire set is 
exhausted, the last object receiving the symbol n’. We assert 
that n’ = n. 

The proof is divided into two steps. (1) If in the new enumer- 
ation the symbol 1 is associated with the same object a as in 
the old, our theorem for the series from 1 to is reduced to that 
for the series from 1 to n — 1. This is immediately evident on 
discarding the object æ and reducing by one the symbols as- 
‘sociated with the objects b, c, + + - in the new as well as in the 
old enumeration. (2) If, on the other hand, the symbol 1 is 
associated with one of the other objects b, c, + - + then in the 
new enumeration the object a is associated with some symbol 
t contained in the series 2, 3, > ++, n. We now introduce a 
third enumeration which enables us to make the transition 
between the first and the second by interchanging the symbols 
1 and t in the second enumeration. The number 7’ is obviously 
unaltered by this process. But we have now introduced an 
equivalent enumeration in which the object æ is associated with 
the same symbol l as in the original and have reduced the 
general case to the one considered in (1) above. The proof of 
the theorem then follows immediately by the method of 
mathematical induction. 

As an auxiliary result of these fundamental considerations 
we have the theorem that any permutation can be obtained by 
the successive application of transpositions. 

The ¥ordan-Hélder theorem is concerned with an abstract 
group g. An invariant sub-group g’ of g which does not coincide 
with g itself is said to be maximal if there exists no invariant 
sub-group of g—except g’ and g—containing g’. The factor 
group g/g’ is then simple, i.e. it contains no invariant sub-group 
with the exception of itself and that consisting only of the 
unit element I. As was recognized by Galois, the so-called 
composition series 


Bo = 9, Gi Gar” © *) Geos Ge = I (6.1) 
is of fundamental importance for the solution of algebraic 
equations. This series begins with g and ends with I, and each 
member is a maximal invariant sub-group of the preceding 
member. We assume that the composition series terminates ; 
this is naturally the case for finite groups, as the order necessarily 
decreases from term to term. The successive factor groups 


8/81 81/82: © © *, Br-1/8r = Gra (6.2) 
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are simple. The Jordan-Hélder theorem asserts that the 
structure of these factor groups, except for the order tn which they 
appear, 1s uniquely determined by g. 

Consider, therefore, a second composition series 


Bo = S, Gy Sa 
of the same group g; it is to be compared with the “ standard 
series’ (6.1). The proof of the fact that this new series also 
contains exactly r -+ 1 terms and that the corresponding factor 
groups are, except for the order in which they occur, isomorphic 
with the factor groups (6.2) is again accomplished in two steps. 

(1) If the two second members g'i, gı coincide, the theorem 
for the group g, whose standard series contains r + 1 members, 
is reduced to the corresponding theorem for the group g,, whose 
standard series contains but y members. 

(2) If gı and g^ do not coincide we construct the inter- 
section Ñ of g, and q’y, i.e. the set consisting of all elements 
common to the two. } is then an invariant sub-group of g'i 
and, as we shall prove, q’,/h is isomorphic with g/ga. That 
two elements s, ¢ of g are equivalent with respect to g,, i.e. that 
they belong to the same “ set,” is expressed by the equation 
t = as where a, isin gı If sand tareat the same time elements 
of the sub-group g’, then æ, is also in g’, and consequently it 
is an element of 8. We may therefore consider as the elements 
of g’,/h those sets in g which contain an element of g'i The 
elements contained in these classes then constitute an invariant 
sub-group © of g containing both g, and g^, and q’,/h is simply 
isomorphic with §/q,. But since g’, is maximal either 6 =g 
or H =g’ The second case implies that g, is contained in g^, 
and since it is maximal it must coincide with g’,, contrary to 
assumption. Hence § coincides with g and our assertion is 
proved. The intersection § of g, and q’, depends symmetrically 
on both, whence g/g’, and g,/§ are also simply isomorphic. 

We now proceed as follows. We construct a composition 
series for }, which we denote simply by §, - + -, and compare 
the following four composition series of g : 


G, G1, Qa" °° 
g, Si; » 
8, g 1 ĵ, n 
8, Q's, 2° 7 * 
The comparison of the first and second series is reduced to case (1). 
The second and third series agree from the member on, and 
the two foregoing factor groups 


9/81, 1/4 


fl 
| ae 
Pea. 
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are, as we have seen, simply isomorphic with 

8/81, 91/9 
on interchanging their order, The comparison between the 
third and fourth series is again reduced to the case (1). The 
proof of the theorem for composition series containing 7 +1 
members is thus reduced to the proof of the corresponding 
theorem for series with but ry members, and since it obviously 
holds for r = 2 (i.e. for simple groups) the method of mathe- 
matical induction establishes its general validity. . 

The close methodological agreement between the construction 
involved in the proof of this theorem and that involved in the 
proof of the independence of the cardinal number of a set of 
the order in which the objects are enumerated is immediately 
evident. 

E. Noether * has given a generalization of the Jordan-Hélder 
theorem which is of importance for us. A correspondence 
S—> s’ = As of the group on itself is said to be automorphic if 
multiplication is invariant under it, i.e. if (st)’ = s’t/—-we here 
neither assume that different elements s generate different 
elements s’ nor that for a given element s’ there exists an element 
s such that s— s’ in virtue of the automorphism. Let J be 
a system of such automorphic correspondences of g. We now 
admit only sub-groups of g which are invariant under 2, i.e. 
sub-groups whose elements are carried over by all operations 
of the system 2' into elements of the same sub-group. We say 
that two such ‘‘ allowed ” sub-groups g, and 8z have the same 
structure if we can set up a one-to-one simple isomorphic 
correspondence between the elements of the one and the ele- 
ments of the other in such a way that every operation A of 
the system 2 sends corresponding elements of the two sub- 
groups over into corresponding elements. The Jordan-Hdlder 
theorem still holds under this modification; its proof can be 
aken over unaltered. 

The vectors of an n-dimensional vector space R constitute 
an Abelian group whose multiplication is the addition + of 
vectors, We must for the moment supplement addition by 
the operation of multiplication of a vector by an arbitrary 
number; hence the concepts and theorems applying to vector 
space are not truly specializations of the concepts and theorems 
of Abelian groups, but there exists a thorough-going analogy 
between the two. Indicating this analogy between a group (on 
the left) and vector space (on the right) by ~ we have, for ex- 
ample, sub-group ~ linear sub-space, automorphism ~ linear 
correspondence. Indeed, a linear sub-space is a system R’ of 
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vectors such that with g and )) their sum g + b and the product 
Ag by an arbitrary number A also belong to R’, and a corre- 
spondence y > g = Ax is linear if it sends ¢ + Y and Ag over into 
r+’ and Az’, respectively. Every “sub-group” is here 
invariant, as we are dealing with Abelian groups. If R is 
a sub-space of R the space R (mod. KR’) obtained by projecting 
R with respect to R’ is the exact analogue of a factor group. 
A composition series consists. of a sequence of spaces each 
member of which is a linear sub-space of the preceding one 
and has one less dimension. The last member is the space 0, 
consisting of the vector 0 alone, and the number of members in 
the series is ] greater than the dimensionality n. The Jordan- 
Holder theorem is here valid but trivial. 

On the other hand, this theorem is of considerable importance 
on going over to Noether’s generalization. Consider a system 
F of linear correspondences of the vector space KR on itself; the 
terms invariant, equivalent, reduction shall in the following refer 
to this system. Two invariant sub-spaces R, and R, are similar 
or equivalent if a one-to-one linear correspondence ¢, ZY, can 
be set up between the vectors of the one and the vectors of the 
other in such a way that any operation A of the system sends 
corresponding vectors over into corresponding vectors. On 
reading the series (4.3) established in § 4 backwards, we have 
the exact analogue of the composition series: each member of 
the series is followed by a maximal sub-space which is invariant 
under X. (The possibility of constructing the composition 
series in increasing as well as decreasing order is due to the 
fact that the addition of vectors is commutative.) Furthermore, 
we can obtain the concepts and theorems relating to a system 
E of correspondences as genuine special cases of those of group 
theory, and not merely as analogues, by supplementing the 
system Æ with all similarity transformations, i.e. by all corre- 
spondences of the form g -> g’ = Ax representing multiplication 
by an arbitrary number A. The Jordan-Hélder-Noether theorem 
now states: Given a second composition series 


0, Mi, Re, o *) R, (6.3) 
the corresponding projection spaces 
RI, R, (mod. Mi), Rs (mod. Hy), + - - 


are equivalent to the projection spaces (4.3) 


MR, Ra (mod. R), Ra (mod, Rə), . 
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of the original series, taken in a suitable order. The number 
of members is, of course, the same in both. The reader is 
advised to reconstruct the proof of this theorem by carrying 
through the proof of the Jordan-Hélder theorem step by step 
for this case. 

In particular, if the system 2 consists of the transformations 
U(s) associated with the various elements s of a group in a 
representation  : s — U(s), our result yields the 

Uniqueness theorem: The irreducible representations separated 
off from Q by successive reduction are completely determined by §, 
except for the order in which they occur, considering equivalent 
representations as the same. In particular, the complete reduction 
of Q into irreducible components is unique, always considering 
equivalent representations as the same. 


§ 7%. Unitary Representations 


For the case in which the representation space R is unitary 
and the correspondences U(s) of ® on itself, associated with 
the element s of the group under consideration, are also unitary, 
certain of the concepts introduced above are to be modified 
accordingly. Two representations 


s—> U(s), s—> U'(s) = AU(s)A™, 


are to be considered as equivalent only if A is unitary, i.e. if it 
is a transformation from one normal co-ordinate system in 
R to another such. If R’ is a sub-space of R a unitary-orthog- 
onal co-ordinate system can be set up in R’ and supplemented 
by additional fundamental vectors to form a complete unitary- 
orthogonal co-ordinate system for the entire space R: every 
sub-space of a unitary space is per se unitary. Invariance and 
reduction remain as before, but we allow only those decom- 
positions of R into two sub-spaces R, + R, in which R,, R: 
are perpendicular. For a system of unitary correspondences 
reducibility implies complete reducibility and we have the theorem : 
If R is invariant with respect to X then R may be broken up into 
R + R” in such a way that R” is also invariant under £. We 
need merely to define R” as the space defined by all vectors per- 
pendicular to R’. The theorem naturally holds for the case in 
which 2 is a system of infinitesimal unitary correspondences or, 
what amounts to the same, a system of Hermitian forms. The 
theorem developed in the preceding section proves that these 
irreducible components are uniquely determined, in the sense 
of (unitary) equivalence, to within a permutation. 
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Examples 
(1) The Unitary Group in Two Dimensions 


The group ¢ = ¢, of linear transformations in two dimensions 
contains the sub-group u = it of unitary transformations. 
Hence the representation ©, of c obtained in § 5 is also a repre- 
sentation of u. This representation is not unitary as it stands, 
but it can readily be made unitary by a slight change. The 
transformation of @, corresponding to the unitary transforma- 
tion s of the co-ordinates x, y is that induced by son the monomials 


Xa = xiyk (i+k=f) (7.1) 
of order f. For purposes of symmetry we label these co-ordinates 
with the index m=1t-—k which runs through the values 


f, f- — f. This is also desirable because on restricting 
 Esclvea to the sub-group of ‘‘ principal transformations ”’ 


X -> EX, yey 


x, is multiplied by the factor e”. We now employ, instead of 
(7.1), the variables 


_ xiyk 
~ Vilk! (7-2) 
obtained from them by multiplication with a constant. The 
representation Wy of u will then be unitary, as follows from the 
equation 


l _ _ gink e piik 
Fi (xa +. yayi == yi ee = DX nXn- 


We call ©; even or odd according as f is even or odd. The even 
representations associate the identity 1 with the reflection 
x = — x, y = — y, 

and the odd associate with it the transformation — 1. Oy is 
also irreducible when considered as a representation of u, and 
on letting f assume the values 0, 1, 2, - they form a complete 
system of inequivalent irreducible representations of u. The proof 
of these assertions, which we employ heuristically in the follow- 
ing, will be given in Chapter V. On writing a homogeneous 
polynomial of order f in the variables x, y in the form 


È ünn 
the coefficients a, transform under the influence of a unitary 


transformation s like the components of a vector in the repre- 
sentation space of Gy. 
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The complete reduction 
(Gy X Oy) = pig + Oya X Gps) 
was accomplished by breaking up the space of the ‘‘ polynomials 
@” into two invariant sub-spaces (®), and (®)’. We must 
now verify that these two sub-spaces are mutually orthogonal 
in the unitary sense. A general polynomial ® may be written 


Sawing (PIPIRI TJ) 


where the x, are given by (7.2) and the é, are the corresponding 
monomials 


= Tal (cte=g,1—K=»). 
Two such polynomials ® with coefficients a,,, bn» are orthogonal 


if 
>» anv On = 0. 


The polynomial xf, whose highest coefficients as, = 1 while 
all others vanish, is to within a constant factor xf + & and is 
obviously perpendicular to all polynomials (®)’, for in all these 
latter the coefficient of xf? vanishes. But under the unitary 
transformation 


six’ = ax + By, y'= — Bx + ay, (7.3) 
where ax + BB = 1, xfé9 goes into 
(ax + By) (a£ + Bn)? (7.4) 


Since (®)’ and the orthogonality of polynomials are both in- 
variant under the unitary transformation s, (7.4) is also orthog- 
onal to (®)’ and, with the help of the definition (5.12) of (®)o, 
it follows from this that all polynomials of (®), are unitary- 
orthogonal to those of (®)’. 

(7.3) is the most general unimodular unitary transformation. 
This is derived in the same way as the familiar formula for the 
orthogonal transformations of two variables with unit deter- 
minant in plane analytical geometry. On writing the coefficients 


a=k +} i B=—p+wy (7.5) 


in terms of their real and imaginary parts we see that each such 
transformation is characterized by four real parameters x, A, p, v, 
the sum of whose squares is 1. The composition of two trans- 
formations s: (x, A, p, v) is accomplished in terms of these 
parameters by Hamilton's quaternion multiplication ; this latter 
led to the vector calculus. 
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(2) Unitary Groups in n-Dimensions 
The totality of tensors of order f is the substratum of an 
nJ-dimensional unitary representation (u)f of the group u = Up, 
for on denoting the components of an arbitrary tensor by 
F(iyt, + + + if the sum 
2 [Flin + + + if? (7.6) 
lin" ip) 
is a unitary invariant. On restricting ourselves to the 
{ 7 }-dimensional linear manifold of anti-symmetric tensors we 


take as the variables in tensor space those components 
F(tyig ++ + ty) for which 7, <i, <+++ <i; The sum (7.6) 
for these components only is, however, equal to the complete 
sum (7.6) divided by f!; hence the representation {u¥ of u, 
whose substratum consists of all anti-symmetric tensors, is 
unitary. The situation is somewhat different for symmetric 
tensors. The most general symmetric tensor of order f trans- 
forms like y X gX- -xg (f terms), ie. we may for the 
present purpose set 


PY(tytg © © e if = HH Kips (7.7) 
We write the monomial on the right in the form 
alt aft + + + xin (5.1) 


as before; f, is the number of times the index r appears in the 
Series 21, 12, © * *, 77. In this sense we write the components of 
a symmetrical tensor 


F (iris on ip) = (fi, fo, ne) Tn): 


The sum (7.6) becomes in this case 


À é . 
E ere lelo fa Sl 
extended over all integral f, 2 0 for which h +f,+--:-+fi=f. 


The coefficient indicates how often the term |F(iis - - + ipl? 
occurs in the sum in consequence of the fact that its value is 
unchanged on permuting the indices. We must therefore 


consider the quantities 


Pl fu fo R, fn) 
VAI fa! ->e fa! 


as independent components of an arbitrary symmetric tensor 
of order f in order to obtain a unitary representation [ulJ. 
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The truth of this assertion follows from the fact that the special 
tensor (7.7) satisfies the equation 


yh. ° - gin. Bhi. . Zin 


sats H + + aba) = E ETT (18) 


We have already seen in I, §5 that a normal co-ordinate 
system can be so chosen that a commutative system X of 
unitary correspondences is completely reduced to a set of 
1-dimensional systems. The only irreducible unitary repre- 
sentations of an Abelian group are accordingly 1-dimensional. 


For it follows from 
U(s)U() = U(st) (4.1) 


and the Abelian character of the group that the unitary matrices 
U(s) associated with the elements s are commutative. 

If © and §’ are unitary representations, then § + 9’, 
© xX ©’ are also.—The first fundamental problem for a given 
group g is to find a complete system of inequivalent irreducible 
unitary representations of g, for then any unitary representa- 
tion of g can be obtained by the addition of these irreducible 
representations. The second fundamental problem is to reduce 
the product H X Q of two irreducible representations H, 9’ of 9 
into its irreducible components ; or better (after having solved the 
first problem), to determine how often each of the irreducible 
representations occurs in this product. 

We illustrate these problems on the example offered by 
rotation groups, which are of particular importance in quantum 
physics. 


§ 8. Rotation and Lorentz Groups 


(a) The Group of Rotations in the Plane 


We describe the 2-dimensional plane by a complex co- 
ordinate x. The rotations of the plane are then given by 
x —> x! = Ex, (8.1) 
where ¢ = e* is a constant with unit modulus. (The rotations 
of the real 2-dimensional plane thus coincide with the unitary 
transformations of a single complex variable.) The angle of 
rotation ¢ determines the rotation completely, but it is of course 
only determined mod. 27 by the rotation. The angle of rotation 
behaves additively on composition: the rotation ¢ followed by 
the rotation ¢’ results in the rotation 6+ ¢’. This rotation 
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group is accordingly a one-parameter continuous Abelian group. 
We obtain a l-dimensional representation D) of our rotation 
group b = b, by associating with the element e, (8.1), the linear 
correspondence 


x> x = eM x = elm x, (8.2) 


where m is any fixed integer. I assert that the D™), m running 
through all integral values, constitute a complete system of 
irreducible unitary representations of Dz. This can be seen as 
follows. 

Any irreducible representation is necessarily 1-dimensional : 
it associates with the rotation ġ a number y(¢) of absolute value 


1 such that 
xl + p’) = x(¢) ` x(¢’)- 


We assume that our representation is continuous; then x(¢) 
is a continuous function of ¢ with period 27. First, y(0) = 1. 
We write x(d) = e°? and determine A(¢) uniquely by the require- 
ments that A(0) = 0 and that A(¢) shall be a continuous function 
of $. We then have 


Ap + £") = AG) + AP’), (8.3) 


for the right- and left-hand sides of this equation could at most 
differ by an integral multiple of 27, but as it is written both 
sides agree for ¢’=0 and vary continuously with 4’. (8.3) 
satisfies the condition A(0) = 0 and we obtain from it the further 


equations 
A(— $) =— AG), Alke) = h: Al), (8.4) 


where k is any integer. On replacing ¢ in the second of these 
equations by ¢/h we obtain 


a(f) = +). (8.5) 


It follows immediately from (8.4), (8.5) that for every rational 
number k/h (k, h integers) 


(Ed) = FAD. (8.6) 


In accordance with our assumptions À(27) is an integral multiple 
Imr of 2r. On setting ¢ = 2r in (8.6) we obtain the equation 
\(¢) = md for all ¢ which are rational fractions of 2a; the 
continuity requirement then allows us to assert its validity 
for all real values of the argument ¢. 
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The simple equation 


Dom) x D) — Plt m’) 
is here valid. 


Consider the function f(p) on the unit circle in the complex 
x plane. If the point p goes over into the point p’ under the 
rotation £, the function f goes into a function f’ which is defined 


by the equation 
| f'(p') = fip). 


The transition f—> f' is a linear correspondence in the œ-dimen- 
sional space of functions f(p) and is associated with the rotation 
e; this obviously defines an oo-dimensional representation of 
the rotation group Dg, which we denote by &. ẹę& is unitary if 
we take as the square of the absolute value of a “ vector ° f 
the integral of | f(p)|? with respect to the element of arc dp on 
the unit circle. The fact that any function (satisfying suitable 
conditions) on the unit circle can be developed in a Fourier 
series means that in the reduction of & into its irreducible com- 
ponents each of the 1-dimensional representations D(™) occurs 
once and only once. More precisely, this reduction is to be inter- 
preted with regard to the completeness relation. 


(b) The Group of Rotations in 3-dimensional Space 


We consider the functions f= f(P) on the unit sphere as 
the vectors of an oo-dimensional unitary space whose metric 


is given by fl {(P)|*dw ; dw is the surface element of the sphere 


over which the integration is to be extended. If the point P 
goes over into P’ = sP under the rotation s, the function f 
goes over into the function f’ defined by /(P’)=/(P). The 
surface harmonics Y, of degree 1 [cf. II, § 4] obviously span a 
(22 + 1)-dimensional sub-space R, which is invariant under the 
totality of transitions f— f’ induced in function space by the 
various elements s of the rotation group b = ds—here again we 
speak of this representation as §§. They are consequently the 
substratum of a certain representation D, of b which is induced 
in R, by d. On choosing a definite direction as that of the 
Z-axis we may, as in II, § 4, take the set 


yy) (m= 1,1—1,- g eet) 


as a basis for the surface harmonics of degree 1. We then have 
a unitary representation, and the sub-spaces ft, corresponding 
to the various values 0, 1, 2, + + - of l are mutually perpendicular 
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in the unitary sense (orthogonality properties of surface har- 
monics). Ð contains the 2-dimensional rotation group d,—e.g. 
as the sub-group of rotations about the z-axis. The structure of 
Y(™ shows that on restricting 6; to this sub-group d, the 
representation $Ð, is reduced into the 1-dimensional representa- 
tions D™ for which m=1,12—1,-+-+, —l The fact that 
any function on the unit sphere possesses a unique expansion 
in terms of surface harmonics means that on reducing & into 
its irreducible components each of the representations Ð, l = 0, 
l, 2, + + *, occurs exactly once. This reveals the true signifi- 
cance of surface harmonics; they are characterized by the 
fundamental symmetry properties here developed, and the 
solution of the potential equation in polar co-ordinates is merely 
an accidental approach to their theory. 

Rotations are orthogonal transformations of three variables 
x, y, % If we wish to include with the: proper rotations with 
determinant + 1 also the improper ones with determinant — 1 
—‘‘ augmented rotation group D’’’—this can be done by intro- 
ducing the reflection 


Tix = — xX, y= —¥, a og (8.7) 


in the origin. Its reiteration 12 is the identity, and it commutes 
with all rotations. The matrix corresponding to it in the 
representation defined by the surface harmonics of degree / is 
the (2l + 1)-dimensional matrix (— 1)’, for the surface harmonics 
of degree l are homogeneous polynomials of degree l in x, y, 2. 
We can thus obtain two representations DF, Dy of the aug- 
mented rotation group from the representation 2, of proper 
rotations; these two coincide with Ð, for proper rotations, 
but in the first the matrix associated with the reflection i is + 1 
whereas in the second itis — 1. We call this + 1 the signature 
of the representation. Hence in the oo-dimensional repre- 
sentation § of the augmented group dD’ each ®, occurs once 
with signature (— 1)!, but not with the opposite signature. 
Although we are not as yet in a position to prove it, the 
D (= 0, 1, 2, °°) constitute a complete system of in- 
equivalent irreducible (single-valued) representations of the 
rotation group 6, and the +, Dr together constitute such a 
system for the augmented rotation group D. 

Now consider the unitary function space of all functions 
f(P) in 3-dimensional space for which the integral |f|? over all 
space is finite. Let the representation induced in this space 
by rotations s, in which the transition from f to the transformed 
function f’ = sf is associated with s, be denoted by ©. Each 
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function f(P) can be expanded in a series of terms of the form 
¢(r)- Yı Choose a complete orthogonal system ¢,(7), da(r), + + - 
in the domain of functions d(r) of the radius r, in the sense of 
the equations 


(°F nlt)balr)dr = Sun 


The functions of the form ¢,(r) - Y, then constitute a (22 + 1)- 
dimensional sub-space Razı which is invariant under rotations 
and in which © induces the representation D, Different Raz 
are mutually unitary-orthogonal. Each ®, then appears in 
© infinitely often, its various occurrences being distinguished by 
the “radial quantum number” n. Consider the analysis of 
single electron spectra given in Chap. II, § 5, in the light of these 
mathematical developments. We then see that the azimuthal 
quantum number l is of purely group-theoretic significance, 
whereas the radial quantum number refers to the dynamical 
situation, for the manner in which the orthogonal system ¢,(r) 
is to be chosen is determined by the dynamical differential 
equation. | 

The proper rotations of 3-dimensional Euclidean space about 
the origin of Cartesian co-ordinates x, y, z, i.e. the real orthog- 
onal transformations with determinant + 1, are most easily 
represented by a stereographic projection of the unit sphere 
about the origin on to the equatorial plane z = 0, the south pole 
of the sphere being the centre of projection. If the point 
(x, y’, 0) be the image on the plane of the point (x, y, z) on the 
sphere and we write { = x’ + iy’, the formule for the projection 


are 
PERSE 2 ae ee eee ae oP 


But it is preferable to introduce the two homogeneous complex 
co-ordinates £, y in place of č by means of the equation £ = {£ ; 
the south pole €:n = 0:1 is then included. We then have 


x+ iy:ix—iy: z : 1 = 
mE : 2H : EE — yy: EE + ni. 
Accordingly each unitary transformation 
o:f = of + Bn, = yé H èn 


of the co-ordinates £, » corresponds to a rotation s of the sphere, 
the points of which are represented by the rays é: y of 2-dimen-~ 
sional unitary space. Since, as is readily seen, any point and 
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tangential direction through it on the sphere can be carried 
over into any other such configuration on the sphere by means 
of such rotations, we obtain in this way all rotations. Since 
we are only concerned with the ratios of the coefficients «, £, 
y, 6, the arbitrary factor of proportionality may be chosen in 
such a way that the determinant of the transformation is 1. 
Nevertheless this normalization is somewhat artificial as the 
correspondence is still double-valued, for on multiplying the 
coefficients of the unitary transformation by — 1, i.e. on going 
over from o to — o, the normalization is unaffected. Hence to 
each element oa, (7.4), of the unimodular unitary group u corre- 
sponds a rotation s:ø— s under which the co-ordinates 
x + ty, x — ty, z transform like 


dye, 26H, EF — ni, (8.8) 
or 


anit fi, y~ (08 — ñ), z~ é — nj. (8.9) 


(The symbol ~, which we occasionally employ, means that the 
expression on the left transforms like the one on the right.) 
We obtain in this way all rotations, each one exactly twice. 
The rotations about the z-axis are obtained from the ‘‘ principal 
transformations ” 


; ,_ 1 
E = e$, 1 = z 


of u. In fact, on setting € = e* = e(w) the angle of rotation 
about the z-axis is 6 = — 2w. In virtue of the correspondence 
o —> $ the rotations in 3-dimensions constitute a representation 
of the group u; and, conversely, the association s—> ø is a 
representation of the group D = D, of 3-dimensional rotations 
by u, although this representation is double-valued. In virtue 
of this correspondence s > o any representation U(o) of u yields 
a representation of b; (“ T process,” § 5); ©, may thus be thought 
of as a representation of Da, in which case we write it ®;, where 


j= 50. The (“even”) D; with integral 7 are single-valued, 


those with half-integral (i.e. half an odd integer) j are double- 
valued. On restricting the group Ð, to the sub-group b, of 
rotations about the z-axis ®, is reduced into the 27 + 1 one- 
dimensional representations D!™ (m=j,7—1,°°*+,— J). To 
show this we first note that the substratum of our representation 
D, consists of the monomials (7.2) 


ink o, 
zm) = L i+ k= 2j, i— k= 2m), 
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where m runs through the values j, dee ae, en j. The 
transformation induced on these variables by a rotation ¢ 
about the z-axis is accordingly 


x(m) -> e(— md) - x(m). 


The representation ø —> s of u is itself contained among the 
representations D, of u constructed above; it is, in fact, 9,. 
To show this we note that if (£, ), (€’, n’) be subjected to the 
same transformation o of u, then the determinant én’ — n£’, as 
well as ££ + nj, is invariant. Consequently (#, 7) transform co- 
grediently to (n’, — £’), or as (n, — £) ; hence 


x+ iyn, z—iy~— i, g~n. (8.10) 


The representations Ð; with integral j are identical with those 
obtained above as the representations induced on surface har- 
monics of order j, for each polynomial in x, y, z of degree j is, 
in virtue of (8.10), equivalent to a form of order 2j in £, ». 

If we wish to augment u = tta in a manner paralleling the 
augmentation of D =, by the improper rotation 7 (reflection 
in the origin) we must consider it as an abstract group rather 
than a group of linear transformations in two vatiables. Denote 
the element corresponding to 7 by « and the elements of the 
original u by ø as before. We define the augmented w as the 
totality of elements of the types ø and ww; « must naturally 
obey the multiplication laws 


o=o, w= th. 


GF and ©; are then those representations of w’ which coincide 
with Œ, for elements of the restricted group u and which as- 
sociate with the element ı the unit matrix + 1 and its negative 
— 1, respectively. The sign + is again called the signature. 
The representation ©F associates the augmented rotation group 
Dd’, with w. ) 


(c) The Lorentz Group 


Let the 3-dimensional Euclidean space be referred to homo- 
geneous projective co-ordinates x, (a = 0, 1, 2, 3) defined by 


The equation of the unit sphere is then 
— + xt + 8+ 22 = 0 (8.11) 
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and the formule for the stereographic projection considered 
above become 


xo = É + M, and 


1 _ > _ 8.12 
xa = =(39 — 7), X = €E — Fn (8.12) 


On subjecting é ņ to an arbitrary linear transformation o the 
Xa undergo a corresponding veal linear transformation s which 
leaves the equation (8.11) invariant. If the absolute value of 
the determinant of o is 1, we can readily show that the form 


— ete et 4 xk + x2 (8.13) 


is itself invariant under the corresponding s, and that the deter- 
minant of s is + 1. 

We now consider x) = ct, %1, Xa X as the co-ordinates of 
space-time; (8.11) is then the equation of the light-cone, the 
generators of which are the possible paths for a beam of light. 
In the restricted thedry of relativity normal co-ordinate systems 
for space-time are connected with each other by arbitrary 
Lorentz transformations, i.e. by any real linear transformation 
which leaves the form (8.13) invariant and which does not 
interchange past and future. Lorentz transformations con- 
stitute a group, the ‘‘ complete Lorentz group,” and this group 
describes the homogeneity of the 4-dimensional world. This 
group consists of ‘‘ positive ” and ‘‘ negative’’ transformations, 
1.e, transformations with determinants + l and — 1, respectively. 
The first constitute the ‘‘ restricted Lorentz group,” from which 
the complete group is obtained by introducing in addition the 
spatial reflection 


Xo > Xo Xa > — Xa (a =], 2, 3). (8.14) 


Under the restricted group right and left, as well as past and 
future, are fundamentally different. Since the expression for 
' Xa in (8.12) is positive definite, we may state the result obtained 
above in the form: any linear transformation of €, n, with deter- 
minant of absolute value 1, induces a positive Lorentz transforma- 
tion s in the x,. Transformations o which differ only by a factor 
e*4 of absolute value 1 give rise to the sames. The correspondence 
o —> s is naturally a representation. 

The question of whether every positive Lorentz transformation 
s can be obtained in this way arises immediately. That this 
is in fact the case can be seen from general continuity con- 
siderations, for the positive Lorentz transformations constitute 
a single connected continuum. But it is also easily proved by 
elementary methods. Since we have seen in (b) above that the 
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rotations of space s are obtained from the unitary transforma- 
tions o, we need only to examine the Lorentz transformation 


1 
(Xo + Xa) > a (xo + x3), (£o — Xz) —> zalo — %3), 
xy —_ Xis Xo -> Xo, 


affecting the time axis, where a is a real non-vanishing constant. 
But this transformation is obtained from the unimodular oc: 


l 
E> aé, n> z 


Returning to the general case, the correspondence s—> o is a 
2dimensional representation of the restricted Lorentz group. 
But o is determined by s only to within the arbitrary “ gauge 
factor’ e"; we may therefore normalize it by the condition 
that the determinant of o shall itself be unity, not merely its 
absolute value. Even so, o remains double-valued, for — o 
satisfies the normalizing condition as well as o. This repre- 
sentation $-» a contains the representation of the rotation 
group considered in (b) on allowing s to run through the sub- 
group of spatial rotations contained in the restricted Lorentz 
group. 
The expressions (8.12) are Hermitian forms with matrices 


‘ re 5 nf? 0 —i 
e oa S o i 0 0 —1 


Hence if g denotes the onc-columned matrix with elements £, 7 
equations (8.12) may be written 
Xa = T SaL. (8.16) 
On replacing £, 7 by 7, — € the xa undergo the spatial re- 
flection (8.14). That is ome way of including the negative 
Lorentz transformations. But if we require that the corre- 
sponding transformation of £, y be linear, we must introduce in 
addition to g = (É, n) a second pair zr’ = (€’, n’) which undergoes 
the transformation of contragredient to 3. Then 
(ñ, = E)~(€, 7’) to within the factor d, 
(y, -~ €)~(€’, 7’) to within the factor d, 
where d is the determinant of o. Defining 
Siow Sy Shwe — S, (a = 1, 2, 3), 
the quantities 


} 2 — 


(8.15) 


! 3 = 


? nd a at 
x, = USAT 
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undergo the same transformation s as (8.16), provided the 
absolute value of the determinant of o is 1. The same is true 
for any linear combination of the two, e.g. x, + xa. Hence the 
quantities 


ta = ESE+ESi O (8.17) 


undergo the given positive Lorentz transformation s when é, 7 
are subjected to a certain transformation o and simultaneously 
é’, ņ' to the transformation o’ contragredient toc. Furthermore, 
they undergo the transformation (8.14) on interchanging the two 
pairs r, t’, ie. on subjecting the four variables to the trans- 
formation 


T:E, non; EE, 7. (8.18) 
The expression 
EE + ain! 
is invariant in virtue of the transformation law of Z’, y’ defined 
above. To obtain an expression which is also invariant under 


the interchange (8.18) we must add to the above the expression 
obtained from it by this interchange : 


(EE + m) + (EE + F'n). (8.19) 


It will be found advantageous to denote the column con- 
sisting of the four elements (E, 7; €', 7’) by a single letter x. 
Let that linear transformation of these four variables which 
transforms é, 7 in accordance with S, and €’, 7’ in accordance 
with Si, be denoted simply by S,: (8-17) then becomes 


Xa = T Sa T. (8.16’) 


We must now ask to what extent the linear transformation o 
of the four variables g is determined by the requirement that 
it induce a given (positive or negative) Lorentz transformation 
s of the Hermitian forms x,. It suffices for this purpose to 
inquire what transformations of the g induce the identity on 
the variables x, The only transformations of this latter kind 
are those which multiply £, 7 with a common factor e°* of absolute 
value 1 and at the same time &’, 7’ with any factor e?” (inde- 
pendent of the first) of absolute value 1. But o can be more 
precisely specified by the requirement that (8.19), ie. Tz, be 
also invariant. The two arbitrary “gauge factors” e4, e™ 
must then coincide: the substitution o is then determined to 
within a factor e’, | 

Our analysis reduces the problem of the representations of 
the Lorentz group to the corresponding problem for the uni- 
modular linear group Cy. 


4 
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§ 9. Character of a Representation 


The trace of a linear correspondence A, i.e. the sum of the 
elements in the principal diagonal of the matrix A, is an in- 
variant under transformations of co-ordinates which is of 
particular importance. The trace x(s) of the correspondence 
U(s) associated with the element s of the group g in a repre- 
sentation © of g is called the group characteristic, or, in 
order to avoid assigning yet another meaning to this second 
word, which has already appeared in another important con- 
nection in quantum mechanics, simply the character of the 
representation §. Equivalent representations have the same 
character ; the name is so chosen because the converse of this 
theorem is true within wide limits. Since U(I) = 1, the value 
of the character x(I) for the unit element is equal to the dimen- 
sionality of the representation. 

It follows from the equations 


U(asa~1) = U(a)U(s) U(a“) = U(a)U(s)U-*(a) 


that the matrices U(s) and U(asa~4) differ only in their orienta- 
tion and consequently have the same trace: 


x(asa-*) = x(s). 


Now s and asa! are any two conjugate elements of the group g, 
i.e. they belong to the same class of conjugates in the sense of 
§ 3. We speak of a function f(s) on the group manifold which 
has the same value for all elements s belonging to the same 
class as a class function ; such a function can at most allow us 
to distinguish between different classes, but not between ele- 
ments of the same class. The distinguishing feature of class 
functions can also be expressed in the equation 


fist) = f(ts). 
The validity of this equation for f= x follows from 
U(st) = U(s)U(t), U(ts) = U(t)U(s) 
and the fact that the trace of the matrix AB is equal to the 
trace of BA. 


The character x/(s) of a unitary representation : U(s~1)=U*(s), 
satisfies the equation 
x(s*) = x(s). (9.1) 


We shall say that the chdracters of irreducible representations 
are primitive. Any unitary representation Q can be reduced 
into its irreducible components, and the normal co-ordinate 
system in the corresponding sub-spaces can be so chosen that 
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two irreducible constituents are equal if they are equivalent. 
If in this sense 
H =m] +m F 4., (9.2) 
where þ, Ņ', - > - are inequivalent irreducible representations and 
m, m +++ are the numbers of times they occur in §, then the 
character X of H is expressed in terms of the characters y, y’, =° 
of , h’, - - - by the equation 
X(s) = mx(s) + m’x’(s) + es. (9.3) 

From an n-dimensional representation §:s— U(s), with 
the character y(s), and an xn’-dimensional §’:s-—> U'(s) of 
character y’(s) we can construct the (#n’)-dimensional repre- 
sentation ) X ^. The elements in the principal diagonal of 
U(s) x U'(s) are obtained by multiplying all elements in the 
principal diagonal of U(s) by those in the principal diagonal 
of U'(s): the character of H x ' is consequently x(s) x'(s). Again, 
if H is a representation of the group g, ’ a representation of 
the group q’, then the representation © X 9’ of g x g' has the 
character ¢ defined by 

t(s, s’) = x(s) x"(s‘), (9.4) 
where s runs through the elements of g and s’ those of g’. 

We need not distinguish between. a 1-dimensional repre- 
sentation and its character; the character satisfies the simple 
equation (4.2). This holds, for example, for the characters 
e(md), eq. (8.2), of the rotation group Dp. 

By the theorem on the transformation of unitary correspond- 
ences to principal axes, each element of the group wu = u is 
conjugate to a principal element, i.e. an element of the form 


e 0 
= l 9.5 
> I jel (9.5) 
€ 


The characteristic values €, 1/e are determined to within the 
order in which they appear. Introducing the angle w by the 
equation € = e(w), w characterizes a class of conjugate elements 
of u; we are only concerned with w mod. 2r, and furthermore 
the class — œw coincides with the class w. Since for any re- 
presentation © of u the character x(s) depends only on the class 
of the element s, it suffices to calculate it for elements of the 
form (9.5). It must be a periodic function of the angle w with 
period 27, and it must furthermore be an even function of w ; 
its value for Gy is 
efti — e—(f"1) 
i a . -+ ef = ——_—_.—_ 


— (9.6) 
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The characters of the representations considered in the 
other examples of the preceding section are just as readily 
calculated. 


§ 10. Schur’s Lemma and Burnside’s Theorem 


Lemma (10.1).5 Assumption. Let 2 be an irreducible system 
of linear correspondences of an m-dimensional vector space 1 
on to itself, and Q such a system of an n-dimensional vector 
space 8. A linear correspondence A shall satisfy the equation 


ZA = AQ (10.2) 


in the following double sense: for each U of 2 there shall exist 
a V of Q such that 


UA = AV, (10.3) 


and conversely for each V of Q there shall exist a U of Æ such 
that this relation is fulfilled. 

Assertion, Either A = 0 or m = n and det A +0; in the 
latter case Z and Q are equivalent. 

Proof. We first make use of the assumption that 2 is 
irreducible in connection with equation (10.2) in the first sense. 
Considering the kt* column 


Aik, Box, ° * ° Amk 
of A as a vector al), equation (10.3) asserts that the vector 
Uat*) associated with a‘) through the correspondence U is 
a linear combination of the vectors alk), specifically that 


Ual) = 2Unk a), y = | lon |. 


Consequently the sub-space of t spanned by the n vectors al) 
is invariant under 2, But because of the assumption that 2 
is irreducible either a) = 0, A = 0, or the a‘*) span the entire 
space t, in which case m of them are linearly independent ; 
this latter is possible only if n 2 m. That our conclusion 
contains two possibilities is due to the fact that the concept 
of irreducibility contains such an alternative. 

The second part of the assumption can be given a simple 
geometrical interpretation on going over to the transposed 
matrices: §2* is irreducible and for each V* of Q* there exists 
a U* of &* such that 


V*A* = A*U*. 
The reasoning employed in the first part of the theorem allows 


us to conclude: either A* = 0 or m2n. We summarize the 
results thus far obtained in the statement: Either A = 0 or 
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m =n; in the latter case the m = n columns al) of A are 
linearly independent, i.e. the determinant of A does not vanish. 
But then U and V are determined uniquely by the relation 
(10.3) and 2 and Q are equivalent. 

In formulating these results it is desirable to consider the 
case of equivalence separately : 

I. If the two irreducible systems &, Q are inequivalent, (10.2) 
can only be satisfied by A = 0. 

II. If X is an irreducible system a correspondence A commutes 
with all correspondences U of the system £: 


UA = AU (10.4) 


if and only if A is a multiple of the unit matrix 1. 

Assertion II follows from the lemma proved above by 
elementary methods and the fundamental theorem of algebra. 
For by the latter there exists a number « such that 
det (A — al) = 0, and since A’ = A — a1 satisfies (10-4) for 


all U if A does, we conclude that since det 4’ = 0 we must. 


have A’ = 0. 

Applied to representations, our results are : 

Fundamental Theorem (10.5). I. If s—> U(s), s+ V(s) are 
two inequivalent irreducible representations of a group Q, the 


_ equation 
U(s)A = AV (s) 


can be satisfied by no matrix A which ts independent of s, except 
A = 0. 

Il. A matrix A which is independent of s and which satisfies 
the equation 


U(s)A = AU(s) 


for all s is necessarily a multiple of the unit matrix 1. 

If there exists a matrix A which satisfies U(s)A = AU(s) 
identically in s and which is not merely a multiple of the unit 
matrix 1, the argument employed above supplies us with a 
constructive process for the reduction of the representation 
s —> U(s) with the aid of A. 

We now consider an application of these important results, 
which are fundamental for the entire theory of representations, 
in order to prove a theorem due to Burnside. Let X be a 
multiplicative system, i.e. if U, U’ are two correspondences in 
& then the product UU’ is also a correspondence in £. This 
concept is somewhat wider than that of a group; we need not 
require that U possess an inverse—its determinant may be 0. 

Burnside’s Theorem (10.6).6 In an irreducible multiplicative 


<< oer = 
+ 
1 = * 
1 
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system & of linear correspondences U = | | 2esx|| of an n dimensional 
vector space on to itself the components U,, are linearly independent, 
This asserts that the only matrix L which satisfies the equation 


tr (UL) = 2 bas Uir = 0 


for all matrices U of the system is L= 0. Contrary to the 
assertion, we assume there exist non-vanishing matrices satis- 
fying this equation; such matrices we shall call L-matrices, 
It is of course possible that every L-matrix whose first column 


by, ly, a R Ena 


vanishes must itself vanish. But in any case we can find a 
definite column index h with the following properties : there 
exist non-vanishing L-matrices whose first k — 1 columns 
vanish and are such that if the h™ column also vanishes then 
necessarily L = 0. We shall call L-matrices whose first h — 1 
columns vanish special L-mairices. They constitute a linear 
family of m <n dimensions ; we denote a basis for this family 
by 
LY, L,- Lm), 
The }4* column of a special L-matrix will be written I. 
Since X is multiplicative the equation 


tr (U'UL) = 0 


is satisfied by each L-matrix, where U, U’ are arbitrary corre- 
spondences of the system 2, With L, UL is also an L-matrix ; 
obviously it is a special L-matrix if Lis. Each of the matrices 


ULM, UL@,-++ UL» 


is therefore a linear combination of L(@),- - -, L() and each of 
the vectors Ul), ---, Ul™ is a linear combination of the 
vectors I), +--+, I), Accordingly the vectors 10), + + +, I0) 
span a non-vanishing sub-space which is invariant under all the 
correspondences U, and in consequence of the irreducibility 
assumed above it follows that m = n and the vectors [@), + > -, 
I(*) span the entire n-dimensional space. The basis L®), +., Lœ) 
of the family of special L-matrices can be chosen in such a way 
that I), - - -, [() are the fundamental vectors of the space ; 
I) is then the column (1, 0, 0,+--, 0), etc. Since then 


Ot) = u + ee tpp Tm) (10.7) 
we must also have 


ULM) = uy, LO ++ +++ ttp Lm), (10.8) 
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We now consider an arbitrary column, say the A, of L. 
(This is of course of no interest if k < h, for the first h — 1 
columns vanish.) Suppressing the second index k, we now let 
I = (ly +++, In) denote the k column of L. Then in accordance 
with (10.8), equation (10.7) holds for the present [, i.e. the t 
instead of the h column of L. Introducing for the moment the 


matrix 
12)... yom 
Az=||....... 
1 
(0) - m 
a © > h a 
consisting of the k columns of Li), ---, L™, we may write 


(10.7) as the matrix equation 
UA = AU. 


But it follows from this that A must be a multiple of the unit 
matrix, i.e. 


UD EE = C=), 

i i Ò; 0 (r + 1) ’ 

or, returning to the original notation by adding the column 
index k, 


I) = Àx ° of. 


Here we have, by the foregoing, A, = * > «= A = 0, 4 = h. 
The equation 
tr (UL) = 0 
becomes 


n 
Z Unrry = 0, (r= 1, n. n), (10.9) 


ie. all correspondences of the system 2* carry the vector À 
with components (Ay, As, °° *, A») over into the null-vector. 
In consequence of the irreducibility of Z this vector must there- 
fore vanish, which is in contradiction with the equation 4, = 1; 
Burnside’s theorem then follows by reductio ad absurdum.—lKt 
we know that the unit matrix is contained in the system 2, as 
is the case for a representation, we can conclude that A; = 0 by 
taking U in (10.9) as the unit matrix. | 

Reducibility requires that on employing an appropriate 
co-ordinate system all matrices U of the system 2 have an 
entire rectangle of vanishing elements and consequently implies 
a system of homogeneous linear relations between the components 
Ux of a very special kind. Burnside’s theorem states that if 
there exists no system of homogeneous linear relations of this 
special kind, then there exists no linear dependence at all. The 
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real reason for this remarkable fact is of course to be found in 
the assumption that 2 is closed with respect to multiplication. 

If our system 2 consists of an irreducible representation 
which associates with the elements s of the group g the matrix 
U(s), we see from Burnside’s theorem that the components of 
U(s) are linearly independent. The method developed above 
can readily be extended to prove the same for the components 
of two or more inequivalent irreducible representations U (s), 
U'(s), +- «7 From this it follows that in particular there can 
exist no linear dependences between their characters x(s), x'(s), aoe 
Any unitary representation can be reduced into irreducible 
components; the character of H is expressed in terms of the 
characters of these irreducible representations by (9.3). Since 
x(s), x'(s) are linearly independent the coefficients m, m’, wake 
which give the number of times the irreducible representations 
b, 9’, + + + appear in Q, are uniquely determined. This con- 
stitutes a new indirect proof of the following result, which has 
already been proved in § 6 in a more general and more elementary 
way: The irreducible representations into which can be reduced, 
as well as the number of times they occur, are uniquely determined 
by 9, no distinction being made between equivalent representations. 
Two unitary representations ), and , are obviously equivalent 
if every irreducible representation which is contained in the one 
is contained in the other the same number of times. Hence 
if Q, and Q, are inequivalent the character of H, cannot be the 
same as the character of Q, because of the linear independence 
of the primitive characters: a unitary representation is uniquely 
determined by its character alone, and its character may be used 
as a unique name for the representation itself. We here go no 
further into these extensions of Burnside’s theorem, which are 
due to Frobenius and I. Schur, as we shall obtain the same results 
by a more profound method in the next section under assump- 
tions which are more restrictive but which are sufficient for 
our purposes. 

We mention only one consequence. §, 9’ being representa- 
tions of the groups g, g’, respectively, then x §’ is an irreducible 
representation of g xg’. Indeed, there can exist no homo- 
geneous linear relation with constant coefficients Crk; uw between 
the components u4,(s)ui,(s’) of U ($) X U"(s’) except the trivial 
one ¢=0. For on applying Burnside’s theorem for the 
irreducible system § we have 

PH Cik cx Uy. (s’) = 0, 


t,x 


and on applying it again for §’ we must have Ciky we = 0. 
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§ 11. Orthogonality Properties of Group Characters 


If the abstract group g 1s finite, then any representation 
H:s— U(s) is equivalent to a unitary one. To show this take 
any positive definite Hermitian form, e.g. the unit form, subject 
it to all transformations U(s) of and sum over s, We thus 
obtain a positive definite Hermitian form H which is invariant 
under each of the transformations U(s). Now choose the co- 
ordinate system in such a way that H becomes the unit form ; 
then U(s), expressed in terms of these co-ordinates, is unitary. 
This same method of summation over the elements of the group 
gives rise to the fundamental orthogonality relations. 

Let 0:5— U(s), H :s— U'(s) be two inequivalent irre- 
ducible representations of the finite group g, the former being 
g-dimensional and the latter g’-dimensional. We write 


U(s) = |[usx(s)|], U"(s) = ||mie(s)]|, 
U's) = ||ai(s)||.- 
For a unitary representation H’ 
tels) = H,,(S). 


Kt 


If A is an arbitrary matrix with g rows and g’ columns then 
obviously the sum 
SUAU) = B, (11.1) 
t 
taken over all elements t of g, is invariant in the sense that 
U(s)BU'-\(s) = B. (11.2) 
In fact, the left-hand side of (11.2) becomes, in virtue of the 
fact that s -> U (s) is'a representation of g, 
Z U(r) AU), 
where r = st, s being fixed and ż running through all elements 
of the group. We therefore obtain equation (11.2) or 
U(s)B = BU"(s). 
In accordance with the fundamental theorem (10.5) it follows 
from this that B = 0, i.e. 


2 P2 Uini Apt, lt) = 0. 


Writing s in place of t and remembering that the a,, are arbitrary 
numbers, we obtain the g? g? equations 


SUix(S)te(S) = 0, 
or, in dealing with unitary representations, 
Suir) ils) = 0. (11.3) 
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Taking the single irreducible representation s—> U(s) in- 
stead of the two inequivalent representations Q, 9’, we find by 
the same argument that the square matrix 


U(s)AU-*(s) = B, 
found from an arbitrary square matrix 4, must satisfy the 
U(s)B = BU(s). 


This requires, however, that B be a multiple of the unit matrix 1, 


i.e. 
F E wins) ve Šal) = a Sa 


the number ~ depends on the matrix A, the dependence being 
of course linear and homogeneous. Taking as 4A that matrix 
which has as its only non-vanishing element a,, = 1, we obtain 


the equation | 
Z tirls) hals) = Opn Òi- (11.4) 
t,.(S)|| is the matrix reciprocal to IAI . 
Z ei (S) tils) = 5... 


On taking u=7 in (11.4) and summing over i = 1, 2,» > +, g 
we find that 


Now | 


h: Òx = Ekk, 
where k is the order of the group g. 
Expressing the sum 3’ in terms of the mean value M = ` a 
our results may be written in the form 
1 for t = ı k =x 
Mw ..(s)z.(s)} = 4 2 oo (11.5) 
0 otherwise 


for any irreducible unitary representation 9: s > U(s) and 
Mf 1u(s) i, (s)} = 0 (11.6) 


for any two inequivalent irreducible unitary representations 
s—> U(s), s— U'(s). The components of one or more inequivalent 
irreducible unitary representations constitute a unitary-orthogonal 
set of functions on the group manifold. 

It follows from these fundamental orthogonality relations 
that the components u,,(s), ui(s), + + + are linearly independent. 
Since the number of linearly independent functions of an argu- 
ment s which assumes but % values cannot be greater than k 
we must have 


git+g?+- -° Sh, 
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On the left-hand side of this equation occur the squares of the 
degrees of any inequivalent irreducible representation of g. 

We obtain the orthogonality properties of the characters 
by writing k=1, «=e in (11.5), (11.6) and summing over 
these indices : 

Any primitive character satisfies the equation 


Mix(s)x(s)} = 1, (11.7) 


and the characters x(s), x (s) of any two inequivalent irreducible 
representations satisfy 


Mix'(s)x(s)} = 0. (11.7’) 
The primitive characters of inequivalent representations constitute 
a normal orthogonal set of functions. They are consequently 
linearly independent, and from this follow all the consequences 
discussed in the previous section. In particular, a representation 
of g can be unambiguously described by its character, no dis- 
tinction being made between equivalent representations. The 


number of times m the irreducible y occurs in the representation 
X is, following (9.3), given by 


m = MiX(s)x(s)}, (11.8) 
and we have 
WIX(s)X(S)} = m + m? +.. 
This last equation offers a simple criterion for the irreducibility 
of a given representation in terms of its character y: it is neces- 
sary and sufficient that the mean value of xX = |x|*—which is in 
any case integral—be unity. 

Since the characters are class functions we are in dealing 
with them concerned with an argument which runs through 
the K different classes of g; there can therefore be no more 
than K linearly independent class functions. Hence a finite 
group can have no more inequivalent irreducible representations 
than classes. 

Whereas the general concept of a representation seemed at 
first to open up limitless possibilities, we now see that all 
representations are constructed from primitive ones and that 
the number of possible primitive representations is confined 
within narrow limits. The further content of the general theory 
of representations can be stated in the theorem that the sets of 
functions, the orthogonality of which we have shown above, are 
complete orthogonal systems. The primitive characters con- 
stitute a complete orthogonal system in the domain of class 
functions, i.e. there exist exactly K inequivalent irreducible 
representations. The components of a complete system of K 
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inequivalent irreducible representations constitute a complete 
orthogonal system for the totality of functions defined on the 
group manifold, or 

Aa git ptpeee 
where the sum on the right is extended over such a complete 
system and g, g’, + : - are the dimensionalities of the individual 
irreducible representations. 


§ 12. Extension to Closed Continuous Groups 


The theory developed in the preceding sections cannot be 
extended to arbitrary groups, but it is applicable mutatis 
mutandis to a group whose elements constitute a continuous 
closed manifold of a finite number of dimensions. Just as the 
immediate neighbourhood of a point on a surface constitutes 
a plane, so the immediate neighbourhood of a point p, on an 
r-dimensional continuous manifold constitutes an r-dimensional 
linear manifold and the line elements from p to neighbouring 
points p define an r-dimensional linear vector space. We 
assume that the infinitesimal elements of our group g (i.e. those 
clements in the neighbourhood of the unit element 1), or rather 
the infinitesimal vectors leading to them from I, constitute 
such an r-dimensional vector space, the “tangential space” 
to q at I. The concept of an infinitesimal rotation will be 
familiar to the reader from the kinematics of rigid bodies, as 
well as the fact that these infinitesimal rotations in 3-dimen- 
sional space constitute a 3-dimensional linear family—in 2-dimen- 
sional space an [n(n — 1)/2]-dimensional family. The multiplica- 
tion of two infinitesimal elements of the group is then expressed 
by the addition of the corresponding vectorial line elements in 
the tangential space. 

A parallelepiped which will serve as a volume element in 
the neighbourhood of I is defined by r linearly independent 
line elements, and its volume is given as usual by the absolute 
value of the determinant of the components of these r vectors, 
This volume element is, of course, not entirely independent of 
the choice of a co-ordinate system in the tangential space, but 
the transformation to a new co-ordinate system only multiplies 
the volumes of all such elemental volumes in the neighbourhood 
of l by a constant numerical factor, These volumes are there- 
fore determined to within the choice of a unit of measure; more 
than this we can hardly require. 

On extending the theory developed in the preceding section 
te continuous groups integration replaces su nmation, and it is 
therefore necessary to be able to measure volumes on the entire 
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group manifold of g. With the aid of the foregoing volume 
elements in the neighbourhood of Į can be measured and com- 
pared immediately with each other, and the same is true for 
the volume elements at any other point of the group manifold. 
The only difficulty lies in carrying the unit of volume from the 
point | to any other point a. Examination of the argument 
of § 1] reveals that the measurement of volume must have the 
following invariantive properties: the volume of an arbitrary 
element must be unaltered by a left-translation of the group 
manifold which transforms the general element ¢ into r = at. 
But this requirement just suffices to specify the process uniquely. 
Consider the volume element at a which arises from an elemental 
volume at I by the left-translation which throws | into a; per 
definittonem the volumes of these two elements shall be the same. 
On carrying the volume element from a to b by means of the 
translation ¢’ = (ba™)t the equation ¢’ = b(a™tt) shows that with 
this definition of volume the volumes of the elements so obtained 
at a and b are equal. 

We further assume that our continuous group manifold is 
closed—in the sense, for example, that the surface of a sphere 
is a closed manifold in contrast with a Euclidean plane, which 
is open. This guarantees that we shall be able to integrate 
continuous functions of position on the group manifold over the 
entire manifold. We now choose the unit of volume in such a 
way that the volume of the entire manifold g is 1; the integrals 
are then mean values. We naturally require that the components 
of U(s) in a representation s —> U(s) are continuous functions 
of the element s of g. The laws (11.5), (11.6), (11.7), (11.7’) 
and all consequences obtained from them in §11 are then valid 
for irreducible representations of the continuous group g and their 
characters.§ 

The theory would be extraordinarily restricted if the measure 
of volume, which we have introduced in such a way that it is 
invariant under left-translations, were not automatically invariant 
under (1) right-handed translations: s -> s' = sa. and (2) inversion: — 
s—> s' = Ss}. The first of these properties will be established 
by showing that the volume of a volume element at J is unchanged 
on taking it to a by a left-translation and returning it to I by a 
right-translation. Obviously each infinitesimal element ðs of 
the group then undergoes the linear transformation A: 


ds —> 6's = a: ÒS t aun}, 


i.e. the conjugation f, associated with the element æ. Such 
linear transformations in the 7-dimensional vector-space of the 
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infinitesimal elements of the group constitute a representation 
a—» A of the abstract group g. Since g is closed, each A must be 
“ absolute-unimodular,”’ i.e. the determinant of A must have the 
absolute value 1; and this in turn allows us to conclude that 
the definition of transportation of volumes by either left- or 
right-translations leads to the same result. To prove this 
consider the element æ and its powers až, a®, +--+. Since the 
group manifold g is closed, the infinite set a, a?, a®, +++ ong 
possesses a point of condensation b, i.e. an infinite set of ex- 
ponents » can be found such that as n runs through this set 
a* converges to b. To the elements a” and b correspond the 
conjugations A" and B, respectively, and in virtue of the con- 
tinuity assumed above det (A”) converges to det (B) as n runs 
through the chosen set. Now since det(B) is a finite non- 
vanishing number, and since, if the absolute value of the deter- 
minant of A differed from 1, det (A") would tend toward 0 or œ, 
we may conclude the truth of the above assertion. This also 
enables us to prove the truth of (2), invariance under inversion. 
For inversion sends the element 6s at I into — ds, and this 
transformation is absolute-unimodular. Now send one of two 
inverse volume elements at I to a by a left-translation and 
the other to a`?! by a right-translation ; we thus obtain volume 
elements at a and a~! which go into each other by the inversion 
s= s == 5-1, Since both left- and right-translations conserve 
volumes, these two volume elements have the same volume. 


Examples of the Orthogonality Properties 


We have already found the primitive characters for the 
group of rotations d, of a circle into itself: e(md), m = 0, +1, 
+ 2,:+:-+ » where ¢ is the angle of rotation. They constitute, 
in fact, a unitary-orthogonal set of functions : 

‘ 2r (m = m’) 


fe(md) amg db = |p em 


If there existed further irreducible representations their char- 
acters would necessarily be orthogonal to all of these; but this 
is impossible, for the functions e(md), where m takes on all 
integral values, already constitute a complete orthogonal 
system. We have, however, already shown by a more direct 
method (§ 8), which did not involve Parseval’s equation, that 
the system of primitive characters e(m¢d) was complete. It is 
therefore natural to consider Parseval’s equation as the simplest 
case of the general group-theoretic completeness theorem men- 
tioned in § 11. . 
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The character of the representation ©, of the 2-dimensional 
unitary unimodular group u = U, is given by (9.6). Writing 


e=ew), A=e—e l= 2 sina, =~ Ado = do, 


we have 


| Xr xo do = f A o, (12.1) 


This leads us to suspect that de is the volume of that portion 
of the group manifold occupied by those elements o of the group 
whose angles of rotation lie between w and w -+ dw. [The 
total volume of the group manifold is then 


w = 0 


lp 


If this is correct, (12.1) are the orthogonality relations predicted 
by the general theory, and the equation 


do = ARdw 
Darr 


defines the density of the various classes of the group. In the 
last chapter we shall actually carry through the determination 
of volume and verify these results. 

If there were yet another irreducible representation, with 
character y, then é = A: xy would be an odd periodic function 
of w with period 27 which would be orthogonal to all the functions 
és = A-yy, iLe. to the functions 


sin w, sin 2w, sin 3w,:-:: 


But these latter are already a complete orthogonal set for 
the domain of odd periodic functions, and consequently the 
ŒC, (f= 0, 1, 2, ++ +) constitute a complete system of irreducible 
representations of the group u. A direct proof, which is inde- 
pendent of Parseval’s equation, is also to be found in Chap. V, 
§ 16—indeed, it is there carried through for u, in an arbitrary 
number n of dimensions. 
The Clebsch-Gordan series 


Xt Xo = Xtag F Xrag-2 bok X|f-9| (12.2) 


for the characters yy is readily verified. If we know on general 
grounds that the character of a representation specifies it uniquely, 
this equation can be used as a proof of the reducibility of ©, x ©, 
into irreducible components with characters as on the right. 
Since the characters are much more readily handled than the 
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representations themselves this principle offers a very powerful 
method for obtaining assertions concerning representations. 
Let f 2g and multiply equation (12.2), which is to be verified, 
by A: 


E= Who V=ftsftgs—%++5f—s). 
The product of 
&, = eftl— e-(ft1) with y, = 89 + em? +--+ +4 eng 
is the difference of two sums; the one is 
eftgth 4. eftg-k toe. et gfil-g 


the exponent decreasing by 2 from term to term, and the other 
is obtained from this one by replacing all exponents by their 
negative. Hence the product is in fact 


Sfert m e7 (2v!1)), v=f+g f+g—2 nf g. 


The representations G7, ŒF (f= 0, 1, 2, - - -) constitute a 
complete set of inequivalent irreducible representations of the 
augmented group Ua To establish this we first note that in an 
irreducible representation of w the matrix associated with the 
element ı must be a multiple of the unit matrix, for it commutes 
with the irreducible system of matrices constituting the repre- 
sentation. Furthermore, u = l, so this matrix can only be 
+ 1 or — 1. Since the matrix associated with « is a multiple 
of the unit matrix, and since the extension of u to w’ involves 
the addition of a single clement ¢, the representation must remain 
irreducible on restricting the group w to the sub-group u. Hence 
every irreducible representation of u, is obtained by supplement- 
ing the irreducible representations of t, by the association 


> +1 or = —1. 


If H, H run independently through complete systems of 
inequivalent irreducible representations of the two (finite or 
closed continuous) groups g, 9’, respectively, then the § X §’ 
constitute a complete system of inequivalent irreducible rep- 
resentations for the direct product g X g’. To prove this we 
note that since the primitive characters y(s) of g constitute a 
complete orthogonal system for class functions of the element s 
which runs through g and the primitive characters y'(s’) of g 
do the same for g’, the totality of the products x/(s) - x‘(s’) con- 
stitute a complete orthogonal system for the class functions of 
the element (s, 5”) which runs through the group g x g’. 

The representations Œy, , introduced in § 5 constitute a com- 
plete system of irreducible representations of c, when f, g run 
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independently through the numbers 0, 1, 2,°- 


-: we here 
only mention this fact without going further into it. 


§ 13. The Algebra of a Group 


We return for the present to finite groups. In order to be 
able to express the completeness theorem we associate with 
each function x(s) on the group manifold of the finite group g 
its “ Fourier coefficient matrix,” the group matrix. 


X = J'x(s)U(s), (13.1) 
where H :s-— U(s) is a representation of g. The trace of X, 
E = Lx(s)x(s) (13.2) 


is the Fourier coefficient of x(s) with respect to the character 
x(s) of ©. It is here desirable to consider the function x(s) as 
a Single quantity x in the group domain; each element s of the 
group is a dimension in “ group space” and the number x(s) 
is the s-component of the quantity x. We may express the 
quantities themselves symbolically in the form 


x = Six(s)°S. (13.3) 


The matrix X is associated with the quantity x in the repre- 
sentation Q: x > X in ©. Addition of “ group quantities’ and 
multiplication of them by a number are introduced in the usual 
way: x +y has the components x(s) + y(s) and ax the com- 
ponents a«:x(s). Group quantities consequently behave like 
vectors in an k-dimensional space, where k is the order of the 
group. The following definition of multiplication of two arbitrary 
group quantities x and y is suggested by (13.3) : 


z= xy = X x(i)y(t’)tt! = Les): 
t,t 8 
where 


s(s) = 5 x(d)y(). (13.4) 


LL! ws g 


This last equation, in which the sum is to be extended over all 
pairs of elements ¢, t’ whose product is s, defines the product z 
of the quantities x and y. We denote this product by xy and its 
components by xy(s); this is not to be confused with x(s) - y(s), 
the ordinary product of the two numbers x(s), y(s). Addition 
and multiplication of group quantities parallel addition and 
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multiplication of the group matrices associated with them by 
(13.1). Indeed, the product of 


X= xls) U(s) Y= Zyls) U(s) 


is given by 
Z= XY = Z «(tye U(ti’) = 2/2(s) U(s), 


where g(s) is defined by (13.4) 

The operations to which the group quantities may be sub- 
jected: (1) addition, (2) multiplication with a number, and (3) 
multiplication with one another, satisfy the usual laws of 
ordinary algebra with two important exceptions: multiplication 
is not commutative and division is not in general possible, i.e. the 
equation ax = b for given a +0 and b may have no unique 
solution or even no solution at all. But there does exist a 
quantity 1 having the properties of unity: 1a = a1 =a for 
every quantity a; its components all vanish with the exception 
of the one associated with s = l, which is 1. A domain of 
quantities as described above is called an algebra,’ and the 
“ group quantities ” are the elements of the algebra; care must 
be taken not to confuse these with the elements of the group 
(cf. V, §5). The association x-> X in the representation § 
satisfies the conditions : 


1. 1» 1, to the element 1 corresponds the unit matrix 1; 
2, if x > X, y — Y and « is a number, then 


X¥+yoX+Y, oxo aX, xyoXy. 


A representation © of the group is the same as a realization or 
“ representation ” of the algebra of the group by matrices such 
that these conditions are satisfied. Actually all we have done 
here is this: we have gone over from the matrices U(s) associ- 
ated with the individual elements of the group to the linear 
manifold of matrices for which they constitute a basis. 

What characterizes an element a of the algebra whose com- 
ponents a(s) define a class function? We have in general 


ax(s) = Za(st)x(t), xa(s) = Zalts)x(t-"), 


and a class function satisfies the equation 

a(st) = a(ts). 
Hence such an a is characterized by the fact that it commutes 
with all elements x ‘of the algebra: ax = xa. Employing a 


term carried over from group theory to algebra we may say: 
those elements whose components depend only on the class of 
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conjugate group elements to which the argument s belongs constitute 
the central of the algebra. 

We are interested only in unitary representations s —> U(s). 
For such a representation the Hermitian conjugate of (13.1) is 


X= Las) O(s) = LG(s)U(s) = LAs) U9). 


Hence on defining the conjugate xX of the element x by X(s)=(s7}), 
Hermitian conjugate matrices are associated with conjugate 
elements in a unitary representation ; this characterizes unitary 
representations. An element will be said to be real if it coin- 
cides with its conjugate. We have seen that the character 
x(s) of a unitary representation satisfies this condition 
x(s) = x(s~). 

Let © be a g-dimensional irreducible unitary representation 
of g. C= ||cizl| being a given g-dimensional matrix, the element 
c of the algebra defined by 


ce(s) = Des, ° a(s) = È tr [cÕ(s)] 
ik 


is such that c-> C in H; this is readily verified with the aid of 
the orthogonality relations. Hence in the correspondence x > X 
X runs through all g-dimensional matrices. We denote the 
quantity with components $ a(s) by e: The set H of all 
elements of the form 


D Cik Cik; 


where the coefficients c,, are arbitrary, is naturally closed with 
respect to the operations of addition and multiplication by a 
number. But the product of two elements in H is again an 
element in H; indeed, if cis in H and x is an arbitrary clement 
of the algebra both cx and xc are also in H. We express this 
situation in a terminology paralleling that of the theory of groups: 
H is an invariant sub-algebra of the algebra T of all group quantities. 
To prove these assertions we first note that the definition (13.1), 
together with the condition that s — U(s) be a representation 
‘yields the equation 


XU(s7}) = Xx) U (ts~1), 


or, on replacing U(s~) by O(s 


hn 


F 
XB(s) = LO (se)xI0. (13.5) 
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Multiplying on the left by C = ||c:x|| and constructing the trace 
we find 


Ë tr (CX) O(s)] = Jelsi aælt) = ¢x(s), 


whence y = ex isin H : 
cx = 2 Yir Cix (13.6) 


and the matrix 
[y| = CX. (13.7) 


In the same way we can show that if c belongs to H then xc 
does also. If 
x —> X = ||%,| in ġ 
we call 
L Xik Cte 


the component of x in H. In accordance with (13.6), (13.7) this 
component is the product of x with 


8 = eu + Cet: "+ ez; 


it is gx = xg. & is a real element belonging to the central of the 


group algebra J’, with components $ *x(s); it is “ idempotent,” 
i.e, it satisfies the equation ee =e. In particular, the product 
of two elements 


a= San.@x, b= Sone, 


of H with coefficient matrices A, B, is the quantity ab in H 
with the coefficient matrix AB. @ is the 1, the ‘ modulus,” or 
“ principal unit," of the sub-algebra H since ex = xe = x when 
x is in H. The algebra H is identical with the algebra of all 
g-dimensional matrices (‘‘ simple matric algebra’’), The “ units ” 
eix satisfy the equations 


Cir Ory == Cre, Cir Cy, = O forr +s. (13.8) 


The central of the sub-algebra H consists only of the multiples 
of its modulus e. 

An irreducible representation §’:s—> U'(s) = || vats) | of 
dimensionality g’ which is not equivalent to H yields another 
invariant sub-algebra H’ consisting of all elements of the form 


O = Dees | [Ce || = C 
4% 
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The components of e, are ads). It follows from the 
orthogonality relations existing between inequivalent repre- 
sentations that c’ — 0 in the representation ©. If c is in H, 
then, by applying (13.6) for x = c’, ce’ = y is also, but since 
then X = 0 (13.7) yields y=0; the two sub-algebras are 
independent in the sense that the product of an element in one 
with an element in the other is always 0. Hence the “ units ” 
satisfy 


Cik e — 0. (13.9) 
The modulus 
g = Ye’ 


of H’ satisfies ee’ = g's = 0 in addition to e’e’ = e’. 

If als) is a class function, a belongs to the central of I’ and 
if a-> A in the g-dimensional irreducible representation Q 
then the matrix A commutes with all matrices X. Hence A 


is a multiple of the unit matrix: A= z1. By (13.2) we find 
that the trace « of A is * 
x = Za(s)x(s). 


In this way the entire theory of representations can be 
translated into the language of modern algebra. This leads to 
a greater freedom of operation and is preferable for the expression 
of the completeness theorem. The orthogonality relations 
between u,,(s), wi(s), © + lead to Bessel’s inequality 


gtr (XX) ++ ++ Sh- Yx(s)a(s), (13.10) 


where X in the sum on the left is the matrix (13.1) associated 
with x(s) in the g-dimensional irreducible representation and 
the sum is taken over any set of inequivalent irreducible repre- 
sentations ©, +- This inequality is obtained by expressing 
the fact that the mean value of 2(s) 2(s) is non-negative (cf. I, § 7), 
where z is that element obtained from x on subtracting from x | 
its components in H, +=: 


Z=X— (Sliert =x aese 


Since the characters constitute an orthogonal system we also 
have the Bessel inequality 


EE +e ++ S he Sx(s)G(s) (13.11) 


* Cf, also Appendix 2 at the end of the book. 
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‘s defined by (13.2). The completeness theorem asserts 
i ss atk er the fers sign holds when the sum is extended 
over a complete system of inequivalent irreducible representations, 
where in (13.10) x(s) is any function on the group manifold and 
in (13.11) any class function. The second relation is a special 


case of the first, since for class funchons X = J 1. 


If the abstract group g is a finite continuous group which 
is closed in the sense of § 12, instead of a finite group as above, 
the sums must be replaced by integrals ; the measure of volume 
on the group manifold is introduced as in § 12. We then have 


in place of (13.1), (13.4) : 
X = [x(s)U(s)ds, 


ay(s) = |x(st-2)y(t)dt = [x(t)y(i-4s)dt. 


The modulus 1 of the algebra must have as components the 
values of a function 1(s) which vanishes everywhere on the 
group manifold except at the point s = 1 and must there be 


so large that fi(s)as = 1. Such a function does not exist, but 


we can construct functions approximating these conditions 
arbitrarily close. | 

The completeness relations assert that any element x of 
the algebra of a finite group g is the sum of its components in 
the totality of sub-algebras associated with a complete system 
of inequivalent irreducible representations. The group algebra 
T is thus reduced to a set of independent simple matric algebras. 
It suffices to prove this theorem for x = 1: 

ete He e= (en t'e eH ey) tee e, (13.13) 

for on multiplying this by x it follows for all elements x. These 
assertions cannot be carried over to continuous groups in the 
form here stated; we must hold to the formulation (13.10) 
(with = instead of <£) containing an arbitrary function x(s). 
We go into the proof of these results in Chap. V, where all 
the results of this section will be derived anew and discussed in 
detail from another more profound point of view. 


§ 14. Invariants and Covariants 


We first discuss briefly the classical concept of an invariant. 
Consider, for example, the group c = c of homogeneous linear 
transformations of two variables £ 7 with unit determinant, 


Let 
at? + 2bén + cn? 
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be an arbitrary quadratic form in the two variables. The 
“ discriminant ’’ ac — b? is an invariant, for the discriminants 
of two forms which are such that either goes into the other on 
transforming é, 7 by some element of c have the same value. 
We may have, instead of one arbitrary quadratic form, one or 
more arbitrary forms f, ¢, «+ + of given orders, n, v, >. An 
invariant is a rational integral function J of the coefficients of 
these forms which is homogeneous in the coefficients of each of 
the forms f, $, ++ and which has the same value on replacing 
these coefficients by the coefficients of the forms f’, ¢’, ++ + into 
which f, $, - + » are transformed by an arbitrary transformation 
a of c affecting the variables £, n. 

The coefficients a, a,, °° -, a, of an arbitrary form of order 
n in the variables é » undergo a certain linear transformation 
on subjecting the variables to a transformation ø of ¢, and the 
correspondence between o and this transformation constitutes 
a representation of the group ce. The same is true for the totality 
of monomials 


apat -o am gtr te st r=) 
of order r in these coefficients. A homogeneous polynomial 
I of order r in the a; is a linear combination of these monomials. 
We thus see that if J is of given degrees r, p, +++ in the coefficients 
of the arbitrary forms f, ¢, » + + it is a linear combination of 
quantities which constitute the substratum of a definite’ re- 
presentation of c; this representation is known as soon as we 
have given the orders n, v, ©- +- of the forms f, $, + ++ in the 
variables £, 7 and the degrees 7, p, - ++ of the invariant J in the 
arbitrary coefficients of f, $, ++». Discarding the all too special 
formal algebraic assumptions involved in the “ classical” 
concept of an invariant, and which the theory of invariants has 
from the beginning attempted to outgrow by generalizations in 
various directions, we may express the concept in modern 
group-theoretic language as follows : 

Let §:s-» U(s) be a given representation of an abstract group 
g in an n-dimensional representation space R with variables x; ; 
a linear form in the x, 1s said to be an invariant in the representation 
space R of Q if it is unchanged under all the transformations U(s). 
If I, Ie, ++ are invariants in the representation space of 9, 
then any linear combination a,J, + oJ, +° > of them with 
constant coefficients œi, a», °° * is also an invariant. The most 
important problem arising here is naturally that concerning the 
number m of linearly independent invariants in the given 
representation space. If yy, Ye °° * Ym constitute such a com- 
plete set of linearly independent invariants, and if we choose as 
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co-ordinates in R these m quantities and n — m further linear 
forms Yet) °° ‘> Yn Such that the two sets together constitute 
a complete system of linearly independent linear forms in R, 
the transformation U(s) is, in terms of the variables y, 


yi = Vy Vn = Ym; 
Yml = Umr: 1 (S) Yı H't E Umt n (S) Yn 


Ya — Uni (S) Yı +--+ Unn(S) Vn: 


If we are dealing with a unitary representation the y's can be so 
chosen that they define a normal co-ordinate system; § is 
then reduced into m times the 1-dimensional identical repre- 
sentation y =y and an (n — m)- dimensional representation. 
Hence the problem of finding the number of linearly independent 
invariants in the representation space R reduces to finding how 
often the identical representation with the character 1 is con- 
tained in the given $. But by formula (11.8) the solution of 
this problem is given by 


m = Mix(s)}, (14.1) 


or: the mean value of the character y of 9, which is always a 
non-negative integer, gives the number of linearly independent 
invariants in the representation space of $. 

The formula (14.1) answers the principal question arising 
in the linear invariant theory, and we now proceed to an ex- 
tremely brief discussion of the algebraic invariant theory. Let 
©, H, - + + be representations of the same abstract group g in 
the spaces with variables x,, y,, - °°. We consider rational 
integral functions I(%,, yz, * - +) which are homogeneous in the 
variables x; homogeneous in the variables y,, etc. If on sub- 
jecting x, y, ' > + to those linear transformations corresponding 
to the same arbitrary group element s in the representations 
©, H, -- + I remains unchanged, then it is said to be a rational 
integral invariant of the system [®, H, + + *] of representations. 
If the orders p, q, ++ * of the function J in the variables x,, yz, °° - 
are given, the problem reduces to the one discussed above ; 
for the monomials in these variables which are homogeneous 
of order p in the x; homogeneous of order qin the Yy, +- con- 
stitute the substratum of a representation obtained in a certain 
way from ©, Q, ---. But if we consider simultaneously in- 
variants of all possible orders belonging to the system [G, 9, «- °] 
we are confronted with new problems. The most important of 
these, which is answered in the affirmative by the so-called 


Spree ery ou o 
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fundamental theorem of the theory of invariants is: Do there 
exist a finite number of invariants such that all others can be 
expressed rationally and integrally in terms of them? This 
involves the question of algebraic, rather than linear, dependence 
between the invariants. We only mention this higher branch 
of the theory of invariants, and do not go into it further, as it 
bears no direct relation to quantum mechanics. !° 

In addition to invariants or scalars, covariant linear 
quantities, such as vectors and tensors, play an important 
rôle in physics. Let g be the group of all linear transformations 
between the normal co-ordinate systems in space or in space- 
time, e.g. the 3-dimensional group of Euclidean rotations or 
the group of Lorentz transformations, and let §:s—> U(s) be 
an n-dimensional representation of g. A covariant quantity of 
kind §$ is an entity having n components ay, as, ° * +, a, relative 
to any given co-ordinate system for the variables of the transforma- 
tion group g and which is such that on going over to a new Co- 
ordinate system by means of the transformation s of g the new 
components a; ave obtained from the old by the corresponding 
transformation U(s) of H. If © is irreducible such a quantity 
is said to be primitive or simple. Physical quantities are generally 
simple. Thus, for example, the entity whose components are 
the electro-magnetic field strengths in the 4-dimensional world 
is described as an “ anti-symmetric tensor of order 2” rather 
than merely as a “ tensor of order 2” ; we shall see in Chap. V, 
§ 4, that it is therefore a simple quantity. The reduction of 
a representation into its irreducible constituents implies the 
reduction of the corresponding kind of quantities into simple 
quantities. It would appear that the only simple quantities 
with which we deal are tensors which are characterized by 
certain symmetry conditions in addition to their order. We 
shall prove this theorem for the complete linear group ¢ and for 
its unitary sub-group u in Chap. V; it asserts that all repre- 
sentations of c (or u) can be obtained by reduction from the 
powers c, (c)*, (¢)3, +++ and that the irreducible constituents 
of (c)/ are obtained by imposing certain symmetry conditions. 

We must accordingly generalize the problem of the linear 
theory of invariants in the following manner. Consider two 
unitary representations Ĥ:o—> s, §:a—S of the abstract 
group g with elements ø; let their dimensionalities be n, N 
and let ý) be irreducible. We wish to determine all covariant 
quantities of kind h in the representation space of . Calling the 
variables in this representation space x;, which undergo the 
transformation S under the influence of o, such a quantity 
I has n components J,, Ia © * +, J, which are linearly independent 
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linear forms in the variables x; When the Xi undergo the trans- 
formation S the n linear forms J, go over into new ones which 
are obtained from the J, (in which the variables x; have been 
transformed in accordance with S) by means of the transforma- 
tion s of h. If there exist two or more covariant quantities 


I= (qh, Le oe La); r= (i, Te moe, I’), ar te 


of the kind in the representation space of , then any linear 
combination of + «’J’ +--+ with constant coefficients « is 
again a quantity of the same kind. We ask for the number m 
of linearly independent quantities of this kind. The answer 1s 
that m is equal to the number of times the irreducible representation 
h is contained in H. Hence if x, X are the characters of h, 9, 
we have 
m = M{X(s)x(s)}. (14.2) 
In order to prove this statement we choose the co-ordinate 
system x; in the representation space of Q in such a way that 
the matrices of H are reduced into their irreducible constituent 
sub-matrices, the m representations h: h’ = H” = 
being separated out first. The remaining constituents (+), 
+++ are inequivalent to h. Denote the variables in the corre- 
sponding invariant sub-spaces by 


eee — Hm) — 


Bi as Gag. ge BOD eel Ss aoe 
The matrix S is completely reduced into the sub-matrices 
S = 5,7++, S™ = s; simtl)--- arranged along the principal 
diagonal. Let 


Yi = Ay Xt: ae 


Yn = Any %y +t + Any ty 
be a covariant quantity of the kind §. We can write this in the 
form y = Ax in terms of the column x of the N variables x; 
the column y of the n variables y« and the matrix A = ||a,; |. 
The requirement that J be a quantity of kind h means that 
when x is replaced by x’ = Sx, y goes over into y’ = sy, or 


sy = ASx, sAx = ASx, sA = AS. (14.3) 


Corresponding to the reduction of x#-space into irreducible 
sub-spaces, the matrix A of the correspondence of x-space on 
y-space is reduced into matrices 4’, +++, Alm), Amin . 
consisting of the first n rows, «++, the m' set of n rows, 
-+: of A. Equation (14.8) then becomes 


sA’ = A'S, e 8 z sAm = Ams sAM' 1) — Amt slmt) * e à 


e» 6 
? 


pres s- 
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It follows from the fundamental theorem (10.5) on representa- 
tions that A’, - +--+, A are all multiples of the n-dimensional 
unit matrix and that the remaining Amt + + are all zero. 
But this is just our assertion that Y= (Yn Ya °°, Ya) is a 
linear combination of the m quantities 


x = (xi, 26 ae x), 


O C E EE 
of the kind }. 


§ 15. Remarks on Lie’s Theory of Continuous Groups 
of Transformations 


In § 12 we made use of the concept of infinitesimal elements 
of a group in order to establish a method of measuring volume 
on a continuous group manifold. We here discuss this concept 
in detail for the 3-dimensional group D of rotations in Euclidean 
space. This group serves to describe the mobility of a body 
in Euclidean space, one point O of which is fixed in space. Each 
possible position of the body can be considered as arising from 
any given initial position by an operation of b. A material 
substance distributed throughout the space or any portion of 
it moves as a rigid body about O if the position of each of its 
elements at a given moment is associated with its initial position 
by means of a correspondence belonging to b. This is the 
description of the motion of such a rigid body which compares 
the position in any moment directly with the initial position, 
ignoring the intermediate states which it has assumed in going 
from the one into the other. But it seems more natural to 
consider it in terms of a continuous motion in which the position 
of the body undergoes an infinitesimal rotation from moment 
to moment, so that the motion as a whole is the integration 
of a series of infinitesimal operations of b. On employing an 
auxiliary variable ¢ in order to avoid the use of infinitesimals 
and thinking of this parameter as time, the velocity field 
dx = 2%, dy = j, dz = 2 of an infinitesimal rotation is defined 
by [cf. I, § 6] | 


ax = bz — cy, dy = cx — az, dz= ay — bx, (15.1) 


where the constants a, b, c are independent of position (x, y, z). 
These velocity fields, which obviously constitute a 3-dimensional 
linear manifold, are the infinitesimal elements of d; they are 
the “vectors ’’ which define the linear space tangent to the group 
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manifold at the point which represents the unit element I. 
The continuous motion of a rigid body about O is characterized 
by the fact that at each moment its velocity field belongs to 
the 3-parameter linear family (15.1). We may take as a basis 
of this family the three elements D,, D,, Dz obtained by choosing 


a=1b=0,c=0;a=0,b=1,¢=0; 4=0,6=0,c=1. 


We call these ‘‘ the infinitesimal rotations about the x-, y- and 
geaxes.” S. Lie was the first to undertake a systematic study 
of the construction of transformation groups from their in- 
finitesimal elements., In fact, once they are known all the 
substitutions of the continuous group can be generated by 
integration, i.e. by successive application of such infinitesimal 
elements—at least, all those which belong to the same connected 
“sheet” as the identity. (Example: the proper orthogonal 
transformations can be obtained from the infinitesimal ones, 
but not the improper transformations with determinant — 1). 

In general, consider a continuous 7-parameter transformation 
group ©, and let the group manifold be described in terms of 
the parameters $i, Sy, ** *, Sy in the neighbourhood of the unit 
point, at which they vanish. A portion of the group manifold 
is thereby mapped in a one-to-one continuous manner on a 
neighbourhood of the origin in the r-dimensional number space 
of the parameters s. Let the n-dimensional point-field of the 
transformations be described in terms of co-ordinates %4, Xz, °° *, Xn 
in the neighbourhood of the point under consideration, and let 
the correspondence x > x’: 


X; = il£ Xe e e Engi Sx, °° 4) S,) 


be associated with the element (Sı, Sa °° *, S+) of the abstract 
group in its realization by the transformation group. The 
infinitesimal transformation x —> x + dx obtained by assigning 
the infinitesimal increments ds to the parameters s in the neigh- 
bourhood of s = 0 is given by 


da; = (as tot (Las; (15.2) 


the parentheses indicate that the differential quotients are to 
be computed for s, = 0,-- +, s,=0. We postulate a material 
substance which fills the point-field and which is capable of 
executing those and only those motions in which the positions 
of its elements at an arbitrary moment ¢' are obtained from their 
positions at time ¢ by a transformation of ©. Again its motion 
can be more simply described as the result of successive deforma- 


Breiman eee we zis 
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tions corresponding to infinitesimal operations (15.2) of our 
group; the velocity field must at any time have the form 


b; = ($ ope Gate (15.3) 


where o, * * *, @y are constants independent of position. This 
y-dimensional linear family constitutes the infinitesimal group of 
motions of our substance. It is to be observed that the application 
of these infinitesimal processes to our transformation group 
presupposes that the functions ¢, are differentiable with respect 
to sat the point s=0. In the theory of abstract groups the 
point-field is the group manifold itself and we take as a realization 
(left-)translation. In the neighbourhood of the unit element 
s = 0, t = 0 we have, as law of composition, 


(Ste = Palsi =. 8 Sr; ty e ee t,) [a = t, . , r]. 


The introduction of a measure of volume in § 12 presupposes 
that the functions %, are, for sufficiently small ż, differentiable 
with respect to the s at the point s = 0, and that for sufficiently 
small s they are differentiable with respect to ¢ at t = 0. 

The composition of infinitesimal elements of the group is 
expressed by addition of the parameters o introduced by (15.3). 
It might therefore appear as if the infinitesimal elements of an 
y-parameter continuous group need satisfy no condition other 
than that they constitute a linear family. However, that is 
not the case; there are further “ integrability condttions’’ to 
be satisfied. The example of a sphere which rolls without 
slipping on a horizontal table shows that the possible positions 
of a body whose infinitesimal motions have but three degrees 
of freedom can nevertheless constitute a 5-dimensional manifold. 
The integrability conditions we are seeking, which involve 
second order derivatives, guarantee that this situation does not 
arise. We obtain these conditions on expressing the fact that 
the commutator sts} of two infinitesimal elements s, ¢ of the 
group also is an element of the group. This commutator con- 
verges to [| as s approaches the unit element I, whatever ¢ may 
‘be, and similarly as ¢-» | for arbitrary s. The commutator of 
the two infinitesimal linear correspondences A and B: 


dx = Ax, d'x = Bx 


is the infinitesimal correspondence AB — BA; to show this 
we note that the equation 


A(s)B() = Ps, BOA() 
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leads, on writing 5 
dA 

A(0) =1, BO) = 1, (F) = 4, a = B, 

I(s,t) —1 a eS 


lim ds r E : 


8,t—>0 S:t 
to the equation 


C = AB — BA. 


Our main purpose in mentioning these matters is to prepare 
the ground for an understanding from general principles of the 
commutation rules satisfied by the three infinitesimal rotations 
D,, Dy, Dz: 


00 0 0 0 1 0 —1 0 
0 0 —I1), 0 0 Ol, WI O oll. (15.4) 
0 1l 0 —1 0 0 0 0 0 
They are, as is readily shown, 
D, Dy — DD: = D., DD: — D:D; = "a 
D,D, — Da D: = Dy. (19:0) 


We could, of course, take the unimodular unitary group Us 
in two dimensions as fundamental, instead of the group bd, of 
rotations. We denote the two variables which undergo the 
transformations o of the unitary group by €, ņ asin §8. In 
consequence of the correspondence o > s, which was established 
there by means of a stereographic projection, the 3-dimensional 
rotation group now appears as a representation of us. We can 
take as a basis for the 3-parameter linear manifold of infinitesimal 
operators of u, the three particular operators— 


l 1 1 
i . dë — gih dn = ris 
1 l l 
57 Sv: d = — 3 dy = 5 fi (15.6) 
Le — 1 _ l ae 
z7 92: ag = 57 & dyn = — 571; 
here, in agreement with (8.15), 
0 1 0 -1 
Ss = | ’ y =” ; ) ees : : . 
1 0 0 0 —l 


They are the infinitesimal transformations of u, corresponding 
to the three infinitesimal transformations D,, D, D, of d, in 


LIES THEORY OF CONTINUOUS GROUPS 179 


virtue of the correspondence o —> s ; that this is in fact the case 
is readily seen from (8.10) or 


l 
x ~ — E H ne, A z ~ Én. 


Given any representation § :ø-> U(o) of up, its infinitesimal 
operators with matrices 


l 
t (Mo, My, M,) 

corresponding to the infinitesimal operators (15.6) in wu, 
satisfy the same equations (15.5) as the D,, Dy, D3: 


M,M,— M, Mo = iMn. (15.7) 


The matrices Mz, My, M, are of course Hermitian. For reasons 
which will appear in the following chapter we call these the 


components of moment of momentum (or angular momentum) M 
of the representation Ñ, and 


M = M? == Mz + My + M? 


the square of the magnitude of the moment of momentum. If 
H, Q are two representations with angular momenta M, Mt’ 
‘then, in accordance with the general formula II, (10.4), which 
governs the composition of infinitesimal operators by X -multi- 
plication, the representation © x ’ has as moment of momentum 
(M x 1) + (1 x MY). 

We next calculate the moment of momentum WM, of the 
irreducible representation ©; = D; (j = f/2) of u,. It will be 
1 


found more convenient to employ in place of ay Fil the 
transformations | 
l i 
5 (Sz + 1S, ): dE =n, dyn =0 
1 (15.8) 
z (Sa ty) : dé = 0, dy = £ 
In general 


dlé’) — 7 ér- x! dé +s ér ni} dn, 
and on substituting in this the variables 


Ern’ 94 9 
iad ana r ers, (r+ s= 2j, r— s= 2m) 
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i he three infinitesi- 
of the representation space of D,, we find that t nites! 
mal transformations of u, defined by (15.6), (15.8) induce in this 
space the transformations 


5 (Ss + iS,) : dx(m) = Vr(s + 1) x(m — 1) 

= VG F mG — m F T) x(m — 1), 
z (S+ — iS,): dx(m) = Vs(r + 1) x(m + 1) 

= VG— m] +m +1) x(m + 1), 


— $ 


Sz: dx(m) = A 5 


x(m) = m x(m). 


Hence 
(M, + iM,)(m,m—l)=VG+mG—m+)), 
(M, —iM,)(m,m+1)=VG—m(j+m+1),} (15.9) 
M,(m, m) = m. 
All other components (m, m’) vanish. M? is a multiple of the 
unit matrix in ft; : 7 
2 = 99 + l), 
for it follows from 
(Ma + 1M,)(M, — iM,) = M} + M? — i(M,M, — M,M,) 
a M; + M? ++ M, 
that . 
M? = (M, + iM,)(M, —iM,) —M,+ M2, 
and from this and (15.9) that 
M?(m, m) = (j + m)(j — m + 1) — m + m = j(f + 1). 
If on reducing an arbitrary representation the irreducible 


representation D; is found to occur exactly g, times, then M? 


has j(7-+1) as a [(27 + 1)g,]-fold characteristic number and 
M, has the characteristic number m with multiplicity 


“8s (7 = |m|, [m] +1, - - +). 


From this we again see that the multiplicity g; with which 9, 
occurs in the reduction of © is uniquely determined by §. 
These infinitesimal operations can be used to give a relatively 
elementary constructive proof of the fact that the D; are the only 


irreducible representation of u,.!2 
§ 16. Representation by Rotations of Ray Space 


In quantum theory the representations take place in system 
space ; but this is to be considered as a ray rather than a vector 
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space, for a pure state is represented by a ray rather than a 
vector. Two unitary transformations U and €U which differ 
only by a numerical factor € of absolute magnitude 1 are con- 
sequently to be considered as the same, U eU, for they 
determine the same rotation of the ray field. Ina“ ray repre- 
sentation,” which associates with each element s of the abstract 
group g a unitary rotation U(s) of the rays of n-dimensional 
representation space, the gauge factor e(s) may be taken 
arbitrarily for each unitary matrix U(s); if g is a continuous 
group we choose it, however, in such a way that U(s) depends 
continuously on s. The condition for a representation is now 
only 
} U(s)U(t) œ U(st), (16.1) 
Le: 

U(s)U(t) = &(s, U (sò, (16.2) 


where ô(s, #) is a numerical factor, of modulus 1, depending on 
s andi. If by change of gauge U(s) is replaced by e(s)U(s), 
5(s, t) is replaced by 
e(st)e1(s)e(£)8(s, t). 
In the equation 
X = J'x(s)U(s), 


defining the connection between the components x(s) of an 
element x of the algebra of the group and the group matrix X 
which represents it, the x(s) are also dependent on the gauge 
and are sent into e(s)x(s) on the change of gauge defined by 
U(s)—>e(s)U (s). In order that the multiplication law for two 
elements x, y shall, as we require, parallel the multiplication of 
the matrices which represent them we must define 


zyl) = Et, t) (A yt) (16.3) 


in terms of the chosen gauge. The condition 
Z(s7!) = x(s) 


for a real element x is only appropriate if the gauge is so chosen 
that U(s7?}) is the matrix reciprocal to U(s). The algebra of 
the group is to be adapted in this way to the ray representation 
under consideration, whereas in dealing with “ vector repre- 
sentations ’’ it is uniquely determined by the law of composition 
of the group alone. 


Examples. 


I. The 1-dimensional representations are now entirely 
uninteresting, for any l-dimensional matrix~1. But under 
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certain circumstances Abelian groups may possess multi-dimen- 
sional unitary ray representations, whereas any irreducible 
unitary vector representation of an Abelian group is necessarily 
of degree 1. 

We first investigate the simplest example, a finite cyclical 
group (a) of order h, consisting of the elements 


l, a, a’, >» @ A gh} (a zo I). 


Let the element æ correspond to the unitary matrix A in the 
ray representation; then A* = «4 is necessarily a multiple of 
the unit matrix. Since « is of modulus 1 we may change the 
gauge in such a way that A goes into AlV«; then 4* = 1 
and the correspondence a*-» A is a vector representation of 
the cyclical group. Hence by introducing an appropriate 
change of gauge the ray representation can be made into a 
vector representation, 8(s, t) being then 1. 

II. The simplest example of an Abelian group which gives 
rise to multi-dimensional irreducible ray representations must 
consequently be non-cyclic. Consider the group consisting of 


the four elements I, a, b, c with the multiplication table 
= b= f=], 
prica a ees T 


A ray representation % is given by 


S 
| 


10 
~ Ho 1 


Hepa 


Ula) =|) o 


nb 


The normalization is here chosen in such a way that 

Ua) = U(a)U(a-) = 1 
and similarly for I, b, c. The algebra defined by (16.3) for. this 
representation is non-commutative in spite of the Abelian 
nature of the group; it is the algebra of complex quaternions. 
On denoting the elements of this algebra by 

X=xl+Aa+pb+ve, 
the “units” 1, a, b, c have the same multiplication table as 
the corresponding matrices U: 
1 a b c 


a 2 © (The product xy occupies 
l 1c — ib the intersection of the 
row x with the column y.) 


l 

a 

b —ic | 1a 
n 
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The “real” quantities are those for which all components 
K, A, p, v are real. Since in the calculus of quaternions I, ta 
ib, ic are taken as the fundamental units, they are those whose 
scalar component x is real and whose vectorial components 
Aji, pli, v/t are purely imaginary. 

. HI The group U = Uy, of unilary transformations o in two 
dimensions with determinant 1. Consider a representation 
o —> U(a) by rotations in n-dimensional ray space. On changing 
the gauge in such a way that U(o) goes into 


U(c): Wdet U(c), (16.6) 


the determinant of the new U(o) is 1. The only possible diffi- 
culty consists in the fact that the n™ root 


e(o) = Vdet Uo) (16.7) 


is multiple-valued. It is ‘“‘ locally” single-valued, i.e. if we 


have chosen a definite one €, of the n values for the point 


o == 0», we can uniquely determine the root e(o) in a sufficiently 
small neighbourhood of c in such a way that it depends con- 
tinuously on o and goes over into € foro =o». Hence we can 
continue the determination of the root for e = o» in a unique 
manner along a path in the group manifold, starting in co- 
The only question is whether e(ø) returns to its original value 
when we allow o to describe a closed path. This is to be answered 
in the affirmative, since the group manifold of u is simply connected 
in the sense that any closed curve can be drawn together into 
a point by a continuous deformation. For in accordance with 
equation (7.5) the elements of the group are mapped in a one- 
to-one continuous manner on the quadruple («À uv) of real numbers 
which are subject to the condition 


KRA ape ap Wade 

Hence the group manifold has the same topological properties 
as a 3-dimensional sphere in 4-dimensional space. These con- 
siderations thus show that the nt root (16.7) is broken up into 
n single-valued continuous ‘functions over the entire group 
manifold. The method of proof here employed, which is of 
fundamental importance in the whole of mathematics, 1s perhaps 
best known to the reader in the proof of Cauchy's integral 
theorem ; it follows from the fact that the integral of an analytic 
function is locally single-valued, that it is single-valued in the 
large if the region in which we are operating is simply connected. 

The result of our topological considerations showed that 
the formula (16.6) defines n single-valued continuous functions 
U(c). One of them is such that in it U(I) is the unit matrix ; 
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we henceforth denote it alone by U(c). On writing the equation 
U(a) U(r) = 8(c, 7) U(or) (16.8) 


for r= |Ì, and taking into account the fact that U(I) = 1, we 
find 6(s, 1) = 1. On forming the determinant of both sides of 
(16.8) we obtain the equation 


1 = [8(0, 7)]*. 


6(o, 7) is consequently an n'è root of unity which depends con- 
tinuously on 7 for fixed o and which reduces to 1 for r= l; 
hence it is identically equal to 1, and (16-8) becomes 


U(o) U(r) = U(øor). 


Consequently the only ray representations of u, are also vector 
representations, and our considerations show that this theorem is 
valid for any continuous group whose elements constitute a simply 
connected manifold. On going over to the 3-dimensional rotation 
group Ds by stereographic projection, all D, even those with 
half-integral 7, are single-valued when considered as ray repre- 
sentations. Any single-valued continuous ray representation of 
Dg is reducible into irreducible constituents, and the only irre- 
ducible ray representations are the D; (j = 0, 1/2, 1, 3/2, += -) 
obtained earlier in the chapter. But b; is not simply connected : 
we must resort to a two-sheeted covering surface, similar to 
a Riemannian surface but without cuts or branch points, which 
is simply connected. This accounts for the fact that there 
exist irreducible ray representations of b, which may be single- 
or double-valued vector representations, but there cannot exist 
multiple-valued representations of higher degree. 

I have been able to prove the same theorem for the n-dimen- 
sional rotation group (n = 3). This means that there exist 
two closed continuous motions (ie. motions which lead back 
to the initial state) of a rigid body, which is free to rotate about 
a fixed point 0, such that any other closed motion can be con- 
tinuously deformed into one of the two. One of these may be 


taken as vest, and the other is such that it cannot be continuously 
deformed into rest. 
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CHAPTER IV 


APPLICATION OF THE THEORY OF GROUPS 
TO QUANTUM MECHANICS 


A. The Rotation Group 


§1. The Representation Induced in System Space by 
the Rotation Group 


N accordance with III, § 8, we can interpret the theory of 

a single electron in a spherically symmetric electrostatic field, 

as developed in II, § 5, in the following manner. A rotation 
of physical space, i.e. an orthogonal transformation from the 
Cartesian co-ordinates xyz into x’y’s’, induces a unitary trans- 
formation U(s) : Y% —> w’ defined by 


pr (x'y'2') = plxye) (1.1) 
in the system-space % of the electron, the vectors of which are 
the wave functions (xyz) describing the state of the electron. 
The correspondence s-»> U(s) is a definite representation ©, of 
infinitely many dimensions, of the rotation group b, This 
representation © can be reduced into its irreducible constituents 
D, and it is found that each D, with integral / occurs an infinite 
number of times. The total system-space ft is correspondingly 
decomposed into mutually orthogonal sub-spaces R(nl) ; R(x) 
has 2/-+ 1 dimensions and the rotation group induces the 
representation ®, in it. If we introduce in addition the im- 
proper rotations (03) D, always appears in © with the signature 
(— 1). The oco-dimensional sub-spaces X R(nl) associated with 


the various values of l are uniquely determined, but their further 
decomposition into the summands R(n?) is quite arbitrary. In 
particular, this can be done in such a way that the energy of 
the states composing R(n?) has a definite value E(n/). 

We now calculate the operators induced in system-space 
by the infinitesimal rotations of physical space. Denoting the 
increase w’(xyz) — (xyz) by db, equation (1.1) becomes 

oY a Oe, 1 OY ) = 
dys + (Zax +o + syd) = 0 
185 
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for the infinitesimal rotation s which sends 
x, y, z into xz’ =x + dx, y =y + dy, 2 = z + dz. 


Taking as s the three infinitesimal rotations D,, Dy, D, in turn 
[III, (15.4)] and writing the corresponding infinitesimal unitary 
operators in the form 


1 
dys = (Le, Ly; La), 
we find 
1/ d ð 
e ayy a ne 1.2 
La= Gr 5) a 


hQ is accordingly the moment of momentum [cf. II, (4.9)]. 

On going over from one electron to two, the vectors of system 
space are the functions Ą$(x1y131; %2Ye%) of the Cartesian co- 
ordinates of both electrons. The unitary transformation 
U:y—>w’ induced in system-space by the rotation s 1s now 
defined by the equation 


AEREA ; AEA) = P(Y; LY) 


where x19123, and %33% are obtained from %1yı2ı and %ay,%ı 
by the same orthogonal transformation s. This situation can 
be described as follows: The state space R? of the system con- 
sisting of two electrons is R x R and the representation 7 
induced in it is © x &. 

This representation is, as we see, determined by the kine- 
matical constitution of the system alone, and is in no way 
influenced by the dynamical relationships; the rule for x- 
multiplication for the induced representation on composition 
of partial systems presupposes only kinematical, not dynamical, 
independence of the partial systems. 

We can, without further trouble, formulate the situation 
discussed above in terms of the gencral scheme of quantum 
mechanics in a manner which is independent of the particular 
assumptions of Schrédinger's scalar wave theory. This is all 
the more important since it has all along seemed doubtful 
whether the matter waves could be described in terms of a 
single state function %. We set up an analogy between the actual 
displacement of the state of the system in time and the virtual 
change produced by an arbitrary rotation of space. The 
transition from time £ to time ¢’ changes the (arbitrary) state 
č at time ¢ into a state 7’ at time t’, obtained from g by a unitary 
transformation U corresponding to a displacement of the time 
axis which sends ¢ over into #. The displacements along the 
time axis constitute a one-parameter continuous group which is 
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isomorphic with the group of transformations U associated 
with them in system-space. The former group is generated 
from the infinitesimal displacement t —> t + dt, and it therefore 
suffices to give the infinitesimal unitary operator 
dt 
dg = + He 

associated with it in system-space. We called the Hermitian 
operator H the energy. 

On subjecting the physical system (or the spatial co-ordinate 
system in terms of which it is described) to a virtual rotation s, 
the state çp goes over into another state x’. Since nothing 
intrinsic to the system is changed thereby and since the state 
space R is linear and unitary, the transition U(s):¢—> r 
associated with s must also be linear and unitary. As in the 
case of the group of actual displacements in time, this group 
of virtual rotations in space must induce a certain representation 
N in the system-space R; this latter is more properly to be 
considered as a ray, rather than a vector, space. But if we go 
over from the rotation group to the unimodular unitary group 
u (or us) by stereographic projection (III, § 8) and take this 
latter as fundamental, it is, in accordance with III, § 16, not 
necessary to distinguish between ray and vector representations. 
The group of proper rotations can be generated from its infini- 
tesimal operations, and we may take as a basis for these the 
infinitesimal rotations Ds, Dy, D: about the x-, y-, and z-axis. 
It then suffices to know the infinitesimal unitary transformations 


l 
dy = (Ma, My, M,)t 


which they induce in system space. We call the real physical 
quantities of the system which are represented by the Hermitian 
operators M,, M, M, the x-, y-, g-components of the moment 
of momentum St. In order to express them in terms of the 
usual units they must, as was also the case with the energy, 
be multiplied by the quantum of action h. The moment of 
momentum plays the same réle with respect to the virtual rotations 
of space as the energy with respect to the actual displacements in 
time. 

One argument for the appropriateness of our definition of mo- 
ment of momentum is that in the case of the Schrödinger theory 
it leads to the usual formule of classical mechanics. As a further 
justification we prove the general theorem that the moment of 
momentum so defined is constant in time. We saw in II, § 8, 
that the necessary and sufficient condition that the physical 
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quantity represented by the Hermitian operator A be constant 
in time was that A commute with the Hermitian operator H 
induced by the infinitesimal displacement of time. In exactly 
the same way we can show that the commutativity of A with 
M,, M, M, constitutes the necessary and sufficient condition 
that the quantity represented by A remains unaltered under the 
virtual proper rotations of space, i.e. that A is a scalar with 
respect to these rotations. Now the energy is a scalar, hence 


HM, —M,H=0, >>. 


But, on the other hand, these equations assert that M,, My, M: 
are constant in time. 

The infinitesimal rotations generate only the group of proper: 
rotations ; in order to obtain the complete orthogonal group we 
must supplement them with the reflection 7 in the origin, or 
extend the group wu, to the group u by the addition of the ele- 
ment « (III, § 8). « will induce a unitary operator J in system 
space which commutes with all U(s), in particular with the 
moment of momentum Jt = (M,, My, M,), and which satisfies 
the equation JJ = 1; this shows that J is Hermitian, as well 
as unitary. A quantity A which is unchanged by reflection 
must commute with J; hence, in particular, the energy H 
must commute with J. The physical quantity represented by J, 
which we call the signature, is constant in time, as it commutes 
with H. It has, in common with all quantities arising in group 
theory which are not associated with infinitesimal operators, 
no analogue in classical mechanics. 

We reduce the total system-space into invariant sub-spaces 
with respect to the group of displacements in time; such an 
invariant sub-space is carried over into itself by the generating 


infinitesimal operation dy = He. Since we are here dealing 


with a one-parameter Abelian group, or with a single operator H, 
this reduction can be carried to the point in which all the con- 
stituent sub-spaces are l-dimensional. The states contained in 
one of these invariant sub-spaces we call quantum states. 

We now proceed in exactly the same manner to reduce the 
representation N induced in system-space by the group of rota- 
tions into its irreducible constituents D,;. We make use of the 
fact that these are known to us a priori; only the number of 
times they appear in Jt depends on the particular representation 
N. (Of course, we have not as yet shown that the Ð, really 
constitute a complete system of irreducible representations of 
Ds, and it may seem risky to apply the process of reduction to 
the oo-dimensional representation 9%. This procedure can, 
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however, be justified on the basis of the fact that Ð, is a closed 
group. But in the final formulation of quantum mechanics it 
will not be necessary to base our conclusions on such general 
considerations, as the reduction into ®, will be obtained by 
elementary means.) The entire system-space R is thus decom- 
posed into sub-spaces R, Rp, ++ + such that R; is of dimension- 
ality 27 + 1 and the representation induced in it by the group 
u, is D, On adapting the co-ordinate system in system-space 
to this decomposition the variables fall into classes 


x(m) (m = 7, j=l, PA E — j); ER , 
x'(m') m eg Sra ape ee 


) 


under the influence of an arbitrary transformation o of Uk, 


applied to the variables £, 7 the co-ordinates of system-space 
transform in accordance with the law 


imk 
zim) ~ EL (i + k = 2j, i — k = 2m). 


With the reduction of R or K is associated the reduction of the 
angular momentum Mt; in the sub-space R; the components 
of M are given by II, (15.9), from which it follows that the 
square M? of the moment of momentum has there the fixed 
value 7/7 +1). (It is evident from general considerations 
that M? must be a multiple of the unit matrix in R,, for it is 
a scalar and must therefore commute with all the operators of 
the irreducible representation %,.) If the state of the system is 
represented by a vector lying in Ñ; the z-component of its 
moment of momentum is capable of assuming the values m = J, 
j—1,--+:+, —7; the z-component naturally only apparently 
occupies a preferred status, due to the fact that the co-ordinates 
in R, were chosen in a manner which differentiated the z-axes 
from the others. That M, M? can a priori assume only discrete 
values m, j(7 + 1) is essentially due to the fact that the rotation 
group is closed; since the group of displacements in time is open, 
the analogous result for the energy need not in general hold. 
In this connection we wish to emphasize again that the operator 
H depends on the dynamical relationships existing in the system,. 
whereas the representation Jt induced by the group of rotations 
is determined only by the kinematical situation (number of 
elementary particles, etc.). The signature / also assumes a 
definite one of its values +1 in each sub-space R;. For lack 
of a better name we call the states which lie in the sub- 
space R, which is invariant under the group of rotations, 
“simple” states of inner quantum number j. We must 
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be prepared to find that 7 may here assume half-integral as well 
as integral values, in contrast with the Schrödinger theory. l 
On uniting two kinematically independent systems, with 
system-spaces R, R’ in which the rotation group induces the 
representations R, W, the total system has as system-space 
R >< R, in which the representation N x M’ is induced. In 
particular, the moment of momentum of the total system is 


(M x 1) + (1x M) 


where W and M' are the angular momenta of the two partial 
systems. The theorem that ihe moment of momentum behaves 
additively with respect to composition is contingent only on the 
assumption that the parts are kinematically independent, 
whereas the corresponding theorem for energy applies only if 
they are dynamically independent, i.e. in the absence of inter- 
action between the parts. This difference is based on the fact 
that whereas the energy represents that actual change of state 
in the course of time, the moment of momentum represents 
the virtual change associated with a fictitious rotation. We 
reduce R, R into the invariant irreducible sub-spaces 9, R; 
respectively, i.e into the simple states of the two partial systems 
having inner quantum numbers, j, 7. The Clebsch-Gordan 
equation (III, § 5) 


D; X Dy = Djaz +Dato Diza (1.3) 


then tells us: If the two parts are in the simple states with inner 
quantum numbers j, j then the whole has each of the simple states 
with inner quantum number 


J=i+ř, G4+7-1,-°+ [7-7 (1.4) 


associated with it, each exactly once. To include the signature 
we must add: Jf the parts have as signatures the values 8, 8’ 
(è = + 1), the signature of the whole has the value 35’. 

Compare the results which we have obtained with the 
corresponding results in classical mechanics. In both the moment 
of momentum is constant in time and the moment of momentum 
of the whole is equal to the sum of the moments of momentum 
of the two parts. Denoting the magnitude of the moment of 


momentum in classical theory by j, we have, in agreement with 
(1.4), : 


r 
s’ 


j—7| SJ Si+7, 


for the resultant of two vectors of magnitudes j, 7’ is a vector 
whose magnitude J lies within these limits. Quantum mechanics 
deviates from classical mechanics in the following three respects : 
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1. In quantum mechanics the square of th 
is j(j + 1), im classical Ae is Hs G CAE igrizat 

2. Here j can assume only the discrete values 0, 4, 1, 3,- -> 
there 1t may have any non-negative value ; nie Oe 

3. Here the | obtained on compounding two partial systems 
can assume only those values between |j — 3|, G+ 7 which differ 
from them by an integer, there it can assume any value between these 
limitis. 

Already before the rise of the new quantum mechanics a 
semi-empirical description of the regularities observed in spectra 
had been given with the aid of a vector model consisting of the 
vectorial moments of momentum of the individual electrons 
and df the atom as a whole; the observations, assisted by the 
older quantum mechanics, had already led to these three modi- 
fications of classical theory.? 

The reader will perhaps have wondered why we consider 
only the virtual rotations of space and not the translations 
which must also be taken into account in order to arrive at a 
complete description of the homogeneity of space. The reason 
for this is that in studying atoms or ions we treat only the 
electrons as particles, taking the nucleus as a fixed centre of 
force situated in the origin. That this is at least approximately 
correct is due to the fact that the mass of the nucleus is many 
times the mass of the electrons. Space is thereby transformed 
from a homogeneous into a centred space; such a procedure 
naturally allows us to consider only atoms or ions, which have 
a single nucleus. Diatomic molecules are accordingly described 
with the aid of the 1l-parameter group of rotations about the 
axis joining the two nuclei, and not by the full 3-parameter 
group of rotations of space—to this we must add reflection in 
the plane which bisects the axis perpendicularly in case the two 
nuclei are physically equivalent.” If we are dealing with three 
or more fixed nuclei the symmetry either disappears entirely or 
is reduced to at most a finite group of rotations.° 


§ 2. Simple States and Term Analysis. Examples 


To each characteristic value E’ of the energy H there belongs 
a definite sub-space R of R, the sub-space of quantum states 
with energy level Æ’ ; it consists of all states g which are trans- 
formed into E’-¢ by the operator H and is accordingly the 
characteristic space R(E’) associated with the characteristic 
value E’ of H. Since the energy is a scalar, the considerations 


applied in the preceding paragraph to the total space R can also 
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be applied to R’: R is invariant under the operators induced 
in system-space by the rotation group and is consequently the 
carrier of a certain representation of this group, which can be 
reduced into its irreducible constituents. If the energy levels 
are of at most finite multiplicity we are faced with the problem 
of reducing only representations of finite degree. Accordingly 
R is decomposed into the ‘‘ simple spaces ” R, associated with 
the rotation group in such a way that not only the square of 
the angular momentum and the signature have definite values 
in tj, but also the energy has a sharply defined value £;. This 
energy level E, is necessarily (27 + 1)-fold degenerate; we 
speak of an accidental degeneracy when the energy levels of 
different simple sub-spaces R; are equal. J, M, M? and H 
are all simultaneously in diagonal form; that this is possible 
is due to the fact that these four operators all commute among 
themselves. In this way the reduction into simple states can be 
employed in term analysis: each energy level E, possesses an 
inner quantum number j which gives the term the natural 
multiplicity 27 + 1. 

On subjecting the atom to a perturbing field which destroys 
its natural spherical symmetry this (27 + 1)-fold term is broken 
up into 27 + 1 terms. Let the perturbation, i.e. its Hamiltonian 
function W, possess axial symmetry about the z-axis; if Æ; 
possesses no accidental degeneracy, then in accordance with the 
theory of perturbations the perturbed energy levels are given to 
a first approximation by the portion of the Hermitian operator 
W in which R; intersects itself : 


x(m) > LW(m, m’) x(m’) (m =j, j— 1, +++, — 3). 


The rotation about the z-axis with meridian angle ¢ transforms 
x(m) into e(— m@) - x(m), and in virtue of the symmetry assumed 
for W this correspondence of R; on itself must also be represented 
by 

e(— md) > x(m) = XW (m, m’) + e(— m'h) x(m’), 
or 


W(m, m’) e[(m — m’)d] = W(m, m’). 


But this means that all elements W(m, m’) except those in the 
main diagonal vanish, whence 


E; + W(m, m) (2.1) 


are the 2j + 1 perturbed terms. The quantum number m, 
which is capable of assuming the values j, j7—1,--°', —J, 
thus serves to label these components. Perhaps the most 
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important axially symmetric perturbation is that due to a 
homogeneous magnetic field in the direction of the z-axis 
(Zeeman effect); because of this m is called the magnetic 
quantum number. The inner quantum number j of a term 
can be determined spectroscopically by counting the number of 
terms appearing in the Zeeman effect. Sommerfeld first con- 
cluded, from the spectroscopic data, that 7 as well as m must be 
allowed to assume half-integral values. If we consider the 
Zeeman effect to be described by the analogue of the classical 
formula II, (12.5) then 


_ oh. _ _ €|9| 
W = Jue (HM) = hoM,, o = ue" (2.2) 


and W is rigorously in diagonal form : 
W(m, m) = hom. (2.3) 


Our analysis shows that the breaking up of energy levels due 
to an axially symmetric perturbation parallels the reduction of 
an irreducible representation of the rotation group 0; when this 
is restricted to the group D, of rotations about the z-axis: by 
this Ð; is reduced into the 27 -+ 1 one-dimensional representations 
which we have previously denoted by D™ : 


x(m) —> e(— mẹ) < x(m). 


If two kinematically independent parts, which are in the 
simple states R, Ry, are compounded together, the state of 
the composite system is in the (27 + 1)(27 + 1)-dimensional 
product space Rj = R; X Ry. If the parts have the energies 
E, E, then the whole has the energy E; + Ep, assuming no 
interaction between the parts. Introducing a weak interaction 
between the two partial systems and assuming that there is no 
accidental degeneracy, i.e. assuming that all the remaining 
energy levels of the unperturbed system are different from E;y, 
it suffices, to a first approximation, to consider the section 
<H)> of the energy operator H in which R; intersects itself ; 
it is an Hermitian correspondence of R, on itself. We can 
apply the considerations, which were applied above to the total 
system-space R x R, to each of these Rip: R; is to be de- 
composed into sub-spaces belonging to numerically distinct 
characteristic values of <H>. The rotation group induces a 
certain representation in each of these sub-spaces, and this 
can be further decomposed into its irreducible constituents. 
The result is that R; x Wj is, in accordance with the Clebsch- 
Gordan series, reduced into the simple spaces Ru, J =j +7’, 
jty—-l,-c, i—i, in such a way that in each of them 
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the energy <H> has a definite value Ey. Different Ey can only 
“ accidentally ” have the same numerical value. Consequently 
the term E,, is broken up by the perturbation into terms Ey 
in exactly the same way as the representation D x Dy is 
reduced into the irreducible representations Dy. But this 1s 
only correct to the approximation characteristic of perturbation 
theory. As we have seen above, an inner quantum number 
J can be rigorously ascribed to a term E; 1n the approximation 
with which we have been dealing here there 1s associated with 
it in addition the inner quantum numbers f, J} of the parts, in 
the last analysis of the electrons themselves: the energy level 
E arises from a definite term E;; of the unperturbed system by 
interaction of the two parts. Such an association 1s rigorously 
possible for “ simple states,” but the rules based on it lead only 
indirectly and approximately to an analysis of the terms. 


Examples 


If we take the Schrédinger scalar wave theory to be valid 
for a single electron, then a simple quantum state of the electron 
in the field of the nucleus is characterized by the principal 
quantum number n and the azimuthal quantum number 7 (we 
here use the word “azimuthal ” instead of ‘‘inner’’), Such 
a term is (2 + 1)-fold degenerate, and we assume there is no 
further accidental degeneration. The moment of momentum 
is represented by the operator & taken over from classical 
theory; the square of its absolute magnitude is (l + 1) and 
the signature has the value (— 1)'. If f electrons come together 
to form an atom we obtain a term, neglecting interaction between 
the electrons, 


E(nih) + Elna) ++ + + + E(nyzly) (2.4) 
of multiplicity (2, + 1) +++ (24 + 1). The quantum numbers 
n and l refer to the individual electrons. The interaction causes 


a separation which parallels the complete reduction, obtained 
with the aid of the Clebsch-Gordan series, of , 


D, X DX +: xX Dy (2.5) 


into its irreducible constituents Dz with total azimuthal quantum 
number L Each such term is associated with the quantum 
numbers 

(my Ly, nala > + e, nly; L). (2.6) 


If f 2 3 certain Dz appear more than once in (2.5), and we may 
therefore have several (2L + 1)-fold terms associated with the 
same set (2.6); these must then be distinguished from each 
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other by some further index. The square of the total moment 
of momentum is L(L + 1) and the signature (— 1)h +l +-+ +, 
In spectroscopy 1t 1s usual to characterize the values 1 = 0. 1. 2. 3 
4,- + + by the small Latin letters s, p, d, f, +--+ and the values 
L == 0, 1, 2, 3, © + + by the corresponding capitals S, P, D, F en 

We cannot expect the scalar wave theory to be correct 
but must be prepared to describe the state of the wave field 
in terms of a quantity % with several, say a, components 
(hi, Pa °° t Pa), Le. by a covariant quantity of a definite kind 
NW. Each component is a function of the spatial co-ordinates 
xyz; the components will depend on the choice of the Cartesian 
co-ordinate system in such a way that on going over to a new 
co-ordinate system by the rotation s the components will undergo 
among themselves that transformation A(s) which corresponds 
to s in the representation X. Again, consider b, replaced by u, 
as the fundamental group. The general component #,(xyz) of 
the ‘‘ vector ” % has two indices, the index « running from 1 toa 
and the index (xyz) running through all the points of space. 
Let R, be the vector space of functions ẹ(xyz) and R, the 
a-dimensional vector space; the state space of a single electron 
is then Re X RM, Under the influence of the rotation s which 
sends xyz into xy’s" the state % goes over into the state w’ 
defined by the equation 


palk y2’) = Z'asphalxyz), || dug |= Als) ; 


the representation induced in system-space is accordingly 
N= Wx E The moment of momentum M of the electron 
consists of two parts : 


M= (S x 1) + (1 x 9, (2.7) 


the first of which refers to the a-dimensional “ spin space ” Hy, 
the second to the ‘‘ translation space” R, (1 x L,), or simply 


. 1/7 d d . 
Ly, is the operator (153 — 55) which acts on each of the 
a components in the same way; it affects only the index (xyz), 
le. . . 
leaving the index « unaltered. ae is the unitary transformation 


corresponding to the infinitesimal rotation about the x-axis in 
the representation A; (S, X 1), or simply S, consequently - 
affects only the index « and leaves (xyz) unchanged. Only 
the part & appears in classical mechanics ; we call it the orbital 
moment of momentum, and the remaining part © the spin 
moment of momentum, or simply the spin. Its appearance 
is unavoidable so long as the wave quantity ẹ} is not simply a 
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scalar or a set of scalars. Each of the two parts satisfies separ- 
ately the commutation rules III, (15.7), but in general only the 
total angular momentum satisfies the law of conservation. If 
the quantity ¢ is of a simple kind, i.e. if QM is an irreducible 
representation ®,, then a = 2s + 1 and the spin © is equal to 
fe moment of momentum W, associated with the representation 

s- 

Since the Schrödinger theory has proved itself at least 
approximately correct, one should assume that to a first ap- 
proximation each of the components y, satisfies the Schrödinger 
scalar wave equation. So long as we consider this approxima- 
tion, the a components have only the effect of multiplying the 
multiplicity of each energy level by a. But in reality the correct 
differential equations must contain a term, the “spin per- 
turbation,’ which introduces a coupling between the various 
components p. The electron can thus be considered in 
abstracto as a composite system, consisting of the electron 
translation with system-space R, and the electron spin 
with system-space Ra; the spin perturbation is the weak inter- 
action between these two. Because of this the method of 
composition can here be applied. Let X = D, Decompose the 
translation space R, into the (2l + 1)-dimensional sub-spaces 
R(nl) ; the corresponding energy term E(nl) with azimuthal 
quantum number | has, on neglecting the spin perturbation, the 
multiplicity a(2l + 1) and its characteristic space is the space 
Ra X R(nl) of the same dimensionality. On taking the first 
order spin perturbation into account this term is separated 
into the terms E, with inner quantum number j and ‘multiplicity 
(2j + 1) in a manner paralleling the decomposition of the repre- 
sentation D, xX D; into its irreducible constituents: 


D, XD, = FD, jostistl—-1,--++,|l—s|, (28) 


with the aid of the Clebsch-Gordan series. Care must be taken 
to differentiate sharply between the azimuthal and inner quantum 
numbers Z and j. The latter is capable of assuming the values 
given in (2.8); whenever / 2 s the number of different terms in 
such a ‘“ multiplet” is 2s+-1. L? is approximately equal to 
the constant (l + 1), S? is approximately equal to the constant 
sls + 1), and M? is rigorously constant and exactly equal to 
jj +1). We can thus speak of the azimuthal quantum number 
of an actual energy term only to within the approximation 
characteristic of perturbation theory. It is well to set forth 
these considerations beforehand and to approach the spectro- 
scopic data, as we shall in § 4, with them well in mind. 
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§ 3. Selection and Intensity Rules 


We return to the consideration of our system as a whole, 
without resolving it into its individual electrons, and again 
denote the total inner quantum number by 7. Let A be any 
physical quantity of the system, and let it be represented by 
the Hermitian form A; we write that portion of this form in 
which R; intersects Rj, in the form 


Sa(mm')a(m)x’ (mn), (3.1) 


where the indices m, m’ run through the values 
m= j, j- I, sry — 4} m == 7’, 7 — 1, my — J. (3,2) 


If the quantity A ts a scalar, the operator A commutes with the 
operators U(s) induced in system-space by the rotations s. 
On decomposition into these irreducible sub-spaces Ry, Mj, it 
follows from the fundamental theorem III, (10.5), of the theory 
of representations that the section (3.1) of A corresponding to 
the transition R; >R, is zero if f +j and a multiple of the 
(27 + 1)-dimensitonal unit form 


Eilon) x'(m), 


ify =j. 

An analogous situation exists for the group b, of rotations 
about the z-axis, With respect to it the total system-space 
decomposes into 1-dimensional invariant sub-spaces R™ in 
which the rotation with angle ¢ induces the representations 
D : (m) > e(— md) x(m). If we only assume that the physical 
quantity A possesses axial symmetry about the z-axis it follows 
that the coefficient a(mm’) is necessarily zero when the magnetic 
quantum numbers m and m’ of the initial and final states are 
different. 

We now consider a vectorial quantity q with the three 
components qe, dy, gz instead of the scalar quantity A. This 
is of particular importance because such a quantity, i.e. the 
electric dipole moment q of the atom, determines the interaction 
between the atom and radiation—to that approximation in 
which the linear dimensions of the atom may be neglected in 
comparison with the wave-length of the emitted light. If the 
degeneracy of the energy level E; is destroyed by an external 
axially symmetric perturbation, e.g. a homogeneous magnetic 
field in the direction of the z-axis, then the spectral line caused 
by the transition R; — Rj, from the term E; to E’, is broken 
up into the lines associated with all possible transitions 


MOL AOE ee ee ee eee wae 
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(R;, m) > (Ry, m). On calculating the part of the Hermitian 
form representing the electric dipole moment in which the sub- 
space Ry intersects Ry : 


Zq(mm’)a(m)x'(m’), (3.3) 


the ratios of the squares |q(mm’)|® of the absolute values of its 
coefficients determine the relative intensities of these (27 + 1)(27’+-1) 
lines, Since qz is axially symmetric about the z-axis q.(mm’) = 0 
unless m = m ; we thus have the selection rule 


qz: m —> m (3.4) 


for the z-component of the electric moment. On performing 
the rotation with angle ¢ about the z-axis x(m), da F igy, Ja — igy 
are multiplied by e(—mø), e(¢), e(—9) respectively. Since 
i(m)x’(m’\ is therefore multiplied by e[(m — m'\b| we obtain 
the selection rules 
qe + igy: m>m—1, qa — iq: m>m+1 (3.4) 
for the x- and y-components of q. Only the transitions 
m > m—1, m m+i1 (3.5) 


of the magnetic quantum number are allowed ; the first and the 
last generate two waves which are circularly polarised in the xy- 
plane in opposite directions, and the remaining transition m —— m 
generates a wave which is linearly polarized in the z-direction. 
If the equation (2.3) holds for Zeeman effect, the wave number 
of the component m —> m’ is displaced by an amount o(m —- m’) 
from its unperturbed value. Thus in “ normal Zeeman effect” 
we obtain instead of (27 + 1)(27’ + 1) components only three, 
whose polarization is as described above and whose wave numbers 
are displaced by the amounts 0, +o. That the resolution of 
the two terms E; E}, is almost entirely hidden is due to the 
fact that the factor of proportionality ho in (2.3) has the same 
value for both terms. Fortunately most of the cases actually 
observed show “ anomalous Zeeman effect,” in which the resolu- 
tion of the terms can be seen clearly; in order to explain it 
we must change the expression (2.2) for the perturbation due 
to the magnetic field, But the above selection rule for the 
magnetic quantum number, which has been obtained from 
fundamental principles of group theory, is valid in all cases. 

The selection rule for the inner quantum number j is obtained 
in an analogous manner. The three components q», ly, qz of q 
suffer the transformation s among themselves when the x(mm), 
x'(m') are subjected to the transformations corresponding to 
s in the representations D, D, respectively. Or, if we wish to 
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express it in terms of Ug instead of bs, s is that transformation 
which is associated with the element ø of u, in the representation 
Dı This is, of course, merely an expression of the fact that q 
is a vector. Now, in accordance with the terminology intro- 
duced in III, § 14, (3.3) is a vectorial quantity in the representa- 
tion space of D; X Dy, and we are interested in determining 
how many linearly independent quantities of this kind there 
are. Their number is given by the number of times D, is 
contained in D; X D; or D, x D, as an irreducible constituent. 


But a accordance with (1.3) D, occurs in D; x Dy exactly 
once i 


J=j-—1 o j or j4+1 


and otherwise not at all, and we must further exclude the case 
j = 0,7’ = 0. We thus obtain the selection rule 


por gods 35. peed (3.6) 


with the proviso that O —> O does not occur. Since there exists 
but one linearly independent vectorial quantity in the repre- 
sentation space of D; x D, in the cases in which the selection 
rule is satisfied, the components of q(m, m’) are determined by 
purely group-theoretic considerations to within a constant factor 
of proportionality. 

In order to calculate the vectorial quantity (3.3) for 7’ = 7 — 1 
we proceed as follows. Let é, n; £, y’ be two arbitrary points 
on the unit sphere which transform cogrediently under u. 


EE +- yn’ is then the fundamental invariant, and the three 
forms which are obtained from 


l r — F 

pI ($E + īm)" (3.7) 
by multiplication with 

_ a 7, Si (3.8) 


transform in the same way as the (x + iy)-, (x — 1y)-, z-com- 
ponents of a vector, respectively. They are linear in the 
monomials y? of degree k + 2 = 2j and in the monomials 
ET’ of degree k = 27’. Introducing 


Ern’ ; ; E 
XM) = ee AM aay iar caren ia 
(mn) Vr! s! ve Vr! os’ | 
(Qmrtsak+2, m=r—s; Y =r +s =k, 
Im = r' — s') 


PERG mn a 


200 APPLICATIONS OF GROUP THEORY 


as co-ordinates in the representation spaces of D, D, we find 
that the three forms above are of the type (3.3) with 7 =J — 1. 
For example, we obtain for the (x + 1y)-component 


(£é’) (EEN m) Erne £’r-2p/s _ 
Coat = Ziv sl Vena VY 
(r— aw 


= — SVG F mG + män) (m — 1). 


In agreement with the selection rule m —> m — 1 there occur here 
only those terms for which m’= m — 1. Calculating the 
(x — iy)- and z-components in the same way, we find for the 
transition 


]J]—>7 =ņj— l: 


(de + igy)(m, m — 1) = — V (j + m) + m — 1), 
(qa — iqum, m + 1) = vV(j— mG —m— 1), (3.9) 
qa(m, m = v(i + m)(j — m). 


In order to calculate the components for the transition 7 = 7’ 
we must replace the factors (3.8) by 


wE, 2€'9, EEN 
which also transform like the (x + iy)-, (x — iy)- and z-com- 
ponents of a vector. Finally, for the transition 7’ = J + 1 we 
must replace (3.8) by 4’, — £”, €'n'. Since the angular mo- 
mentum Jt is a vector, the formule for the transition 7 >7 
must naturally agree with those already obtained for Mt [III 
(15. 9)), and since q is Hermitian the formulæ for the transition 
j—j+1 must agree with those obtained by taking the 


Hermitian conjugate of the components for the transition 
j>j—l. 


(qa + 14y)(m, m — 1) = V(j+m)\(j—m+ 1), 
(qa — igy) (m, m +1) = VG — m+ m+, (3.9) 
qlm, m) = m. 
j>f=j+1. 


(qe + iqy)(m,m—1)= vV(j— mtj — m+ 2), 
(qa — igy\(m, m + 1) = — VG +m + 1G +m +2), (3.9) 
g.(m, m) = vVG+m+IG— m+. 
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In each of these three sets of formule the right-hand sides are 
determinate only to within a common factor of proportionality 
which is independent of m, but which can be completely deter- 
mined only by integrating the wave equation of the dynamic 
model of the atom, and not by the theory of groups alone. 
The coefficients which do not occur explicitly in the above 
formule are all null. The squares of the absolute values of these 
coefficients yield the (rational !) intensity ratios of the components 
into which a line is split by the perturbation. 

Already before the rise of the new quantum mechanics the 
intensity formule (3.9) for the components of a line emitted 
under the influence of a magnetic field were obtained from the 
observational data under the guidance of the correspondence 
principle.6 In the new quantum mechanics they are, as we 
have seen, a consequence of the most general principles, and we 
would find ourselves in serious difficulties if they were incorrect. 
Nevertheless it 1s to be remembered that they can be invalid 
(1) if the spherical symmetry of the system is destroyed by 
external perturbing fields, or (2) if for short wave-lengths the 
interaction between matter and radiation is no longer determined 
primarily by the electric dipole moment. 

Since the dipole moment is a proper vector, as the components 
Qn Jy, qz FO over into — ge, — ly, — qz on reflection i in the 
origin, the representation Ð, induced on them by u has as 
signature — 1, If the signatures of R;, Ry are 6, 5’, then under 
the influence of the reflection 2 (3.3) is multiplied by the factor 
60°. The coefficients q(mm’) must accordingly all vanish unless 
50’ = — l; the selection rule for the signature is 


ô — — ô. 


If the individual electrons are governed by the scalar wave 
theory the total azimuthal quantum number L of the atom 
can jump only to L — 1, L or L +1, while the sum of the azi- 
muthal quantum numbers of the individual electrons h + lz +*+ ¢ + Ly 
can change only by an odd integer (Laporte’s rule). In the case 
of a single electron, f= 1, only the transitions l—> l + 1 are 
consistent with these rules ; this result has already been obtained 
in II, § 5, from the theory of spherical harmonics. 

The formule (3.9) allow us to solve a problem which we shall 
here, for the sake of future application, introduce from the 
physical standpoint. A partial system in the simple state N; 
is compounded with a second in the simple state Ry to form 
a single system. In Rj = R; X Rj, Ug induces the representa- 
tion D = D; x Dy; let the corresponding moment of mo- 
mentum be WM. On adapting the normal co-ordinate system 
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in R, to the complete reduction of D into its irreducible con- 
stituents Dy, M is broken up into square sub-matrices Wy of 
length 27-+1, arranged along the principal diagonal, corre- 
sponding to the decomposition of R, into sub-spaces Ry. But 
the same is not true of the moment of momentum W, x 1 of 
the first partial system, and we wish to determine the portion 
of this matrix in which Ñy intersects itself. That is, in physical 
language, we wish to determine the temporal mean value «Wt, - 
of the moment of momentum of the first system in the state 
defined by the quantum numbers j, 7’; J of the two parts and 
the whole. We assume that the interaction between the two 
parts resolves the energy level E,» into distinct levels Ey on 
applying the theory of perturbations. Since Ms; is a vector we 
know, from the same considerations as we applied to the electric 
dipole moment above, that the portion of it corresponding to 
the transition J —> J must be a multiple of Wy : 


<M, X 1>, = KJ’ My. (3.10) 


In order to evaluate the proportionality factor « we construct 
the scalar product of the matrices (M, X 1) and M; since 


M == (M, x 1) + (1 x M) 
these two matrices commute and we have 


(1 x M) = M + (M, x 1)? — AMM, x 1) 
2MM, x 1) =j +1) — 7'(7 $1) + Me, (3.11) 


for since in the original co-ordinate system (Mt, x 1)? was 
j(j + 1) times the unit matrix, it remains the same in the new 
co-ordinates. And, on the other hand, M(M, x 1) is equal 
to xz: J(J + 1) times the unit matrix in the sub-space Ry, as 
follows from (3.10). Hence from (3.11) 


MJT +1) =i 1) tatn 
1 iG+0-s¢' +] 
es (3-12) 


§4. The Spinning Electron, Multiplet Structure and 
Anomalous Zeeman Effect 


or 


Ky = 


We have hitherto ignored the fact that the terms of the 
alkali spectra, characterized by the two quantum numbers n, J, 
are in reality not simple. Each of these terms--with the exe 
ception of the s terms / == 0—actually consists of a fine doublet. 
By § 2 the (n, I) term should be resolved into 2 -+ 1 components 
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in a magnetic field; instead we find that one of the doublet 
terms breaks up into 2/ components and the other into 27-+2. 
We should accordingly ascribe to them the inner quantum 


numbers j = / — 7 joHl+e= 5» respectively. 


Our general considerations immediately give us a hint as 
to how this discrepancy is to be explained. The quantity ẹ% 
describing the wave field is not a scalar, but is instead a covariant 
quantity of the kind Dy, having two components (pı, Wa). This 
is the theory of doublet phenomena as developed by W. Pauli.® 
It seems indeed easy to arrive at this conclusion after the 
preparation of the preceding paragraphs, but historically this 
systematic foundation was developed only after Paulis dis- 
covery. It is quite immaterial whether we associate the matrix 
+ 1 or the matrix — 1 with the element c in the representation 
D, of ug. Taking the first of these alturnatives, the signature 
has the value (— 1)! in the quantum state (nly) ; hence Laporte’s 
rule remains rigorously correct on taking the spin into account. 
We have as further rigorous selection rules those concerning 
the total inner and the total magnetic quantum numbers. In 
the representation 2, the transformation o itself corresponds 
to the element o of u, and by III, (15.6), the spin moment of 


momentum is 56, where © is the vector already defined with 


l 0 
> al 

We shall not as yet attempt to find the specific effect of the 
spin perturbation on the wave equation. This was done origin- 
ally by picturing the electron as a small material sphere, the 
rotation of which gave rise to the spin; the additional moment 
of momentum required by spectroscopic observations was first 
introduced in this way by Goudsmit and Uhlenbeck.” Since 
S, is capable of assuming only the values + 1 it appears as if 
the spin axis can only be quantized along the positive or negative 
z-axis ; we need not go into the false conclusions this assertion 
can lead to on interpreting it literally. The spin perturbation 
must appear in going over from classical to relativistic mechanics. 
The terms of the hydrogen atom, calculated in accordance with 
the scalar non-relativistic wave mechanics, depend only on the 
principal quantum number n, but the theory of relativity intro- 
duces a correction which causes the terms corresponding to the 
various values of } to split apart and form the so-called fine 


components 


01 0 -i 
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structure. We should therefore expect the same scheme of 
terms in hydrogen as in the alkalies, but observation shows 
that the doublet separation of an / term into two terms with 


=li - is just such that two terms with the same j, but with 
different l = j + z exactly coincide. Hence the spin per- 


turbation in hydrogen agrees quantitatively with the separation 
caused by the relativity correction. 

The alkali doublets show anomalous Zeeman effect. Other 
elements, such as alkaline earth metals, have (in addition to 
triplets) a system of singlet terms, and singlet terms always 
show normal Zeeman effect in a magnetic field. It therefore 
seems probable that the anomalies in Zeeman effect are closely 
connected with the spin. The magnetic separation of an alkali 
term is quite independent of the principal quantum number 1; 
all the terms of a series behave in the same way. A term (l, 4) 
splits up into 27 + 1 equi-distant components, characterized by 
the magnetic quantum number m, but their separation is hog 
instead of ho, where g is a rational function of l andj (the ‘‘ Landé 
g-factor”). The energy value of the component m is therefore 
displaced by an amount 


hogem (m=j,j7—1,** +, — J) (4.1) 


from its unperturbed value. The empirical formula for the factor 
g, which is due to Landé, is 

+l 

STUFT 


(4.2) 
This formula holds for weak magnetic fields, in which the separa- 
tion is of a smaller order of magnitude than the doublet separation. 
I= 0,] = > we have in particular g = 2. 


This latter fact gives a hint toward the solution of the puzzle : 
If the total moment of momentum consisted only of the spin 
(g = 0), its magnetic effect would be twice as great as if it con- 
sisted of & alone. We therefore assume that the magnetic effect 


ole.. 
of the spin 5 © 1s twice as great as that of the orbital angular mo- 


mentum &; the perturbation due to an external magnetic field 9 
1s therefore to be taken as 


_ ch y ch | l 
W= 500 +O OMS (43) 
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The spin offers an explanation of why the beam in the Stern- 

Gerlach experiment 1s separated into two paris. The valence 

electron of the univalent silver atom is, in the normal state, 

. . . I 

in an s-orbit (l = 0); hence 7 = 5 and m can assume only the 
l . 

values + 5° Although the component of the mechanical 


moment of momentum in the direction of the magnetic field 
h . 
can have only the values + 5’ the experiment shows that the 


value of the magnetic moment of the atom is a whole Bohr 
magneton, and not the half of one; but we now see that since 
the mechanical moment of momentum consists only of spin 
it should give rise to twice the expected magnetic moment. 
The connection between magnetic moment and mechanical 
moment of momentum is even more apparent in the magneto- 
mechanical effect: the demagnetization of a vertically suspended 
bar of weak iron must result in giving to it an angular momentum. 
The ratio between the change in the magnetic moment and the 


7 e . 
moment of momentum was expected to be TA but the experi- 


ment, which was performed only on ferro-magnetic bodies, 
yielded twice this value. The anomalous magnetic behaviour 
of the spin also accounts for this result, if we assume that the 
mechanical moment of momentum in ferro-magnetic substances 
is due entirely to the electron spin.® 

= Does this hypothesis also explain the general Landé formula 
(4.2)? This is answered by the formula (3.12) obtained toward 
the end of § 3, in which 7, 7’, J must be taken as z: l, J in order 
that it apply to the composition of electron spin and electron 
translation. We find that in the state (lj) the temporal mean 


value of the spin T is equal to W multiplied by the factor 


= 1 Ẹł— (l+!) 
e—1=3 t RG ED 
or 
—1= + for jal4h (4.4) 
2+- 1 2 
Hence by (4.3) 
eh 


W> = gie £ (OM) = hog: M.. 
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So long as the magnetic separation is small compared with the 
spin perturbation the Zeeman separation of the term (Į) is 
determined primarily by <W); (4.4) then leads, in fact, to 
equation (4.2), in agreement with the empirical data, | 

If the atom consists of several, say f, electrons, the situation 
then arising can be understood with the aid of the general rule 
of composition. If the electrons are in quantum states with 
inner quantum numbers f, and energy levels E(j,), (ry = 1, 
2, +», f), then on neglecting the interaction between the electrons 
the total system has a (2j,-++ 1) ++ + (27, + 1)-fold energy level 
Elh) ++ +++ E(j;). TE this level coincides with none of the 
other levels it is resolved by a small perturbation into terms 
with total inner quantum numbers / in a manner corresponding 
to that in which the product 


Di, X DX X Dy = ED (4.5) 


is reduced into its irreducible constituents Dy (Clebsch-Gordan 
series). Obviously in order that this (77) coupling lead to an 
adequate description the mutual interactions between the 
electrons must be small compared with the spin perturbation. 

The situation usually met is, however, the opposite of that 
contemplated above: the normal term order corresponds to 
the Russell-Saunders or (sl) coupling. Neglecting for the moment 
the interaction between the electrons as well as the spin per- 
turbation, we are led to a 2/(21,-++ 1) + + - (2L, + 1)-fold energy 
level (2.4) in whose characteristic space the rotation group in- 
duces the representation 


DS X (Di, X Dp X X D,,). (4.6) 


Due to the interaction between the electron translations the 
second factor is reduced in a manner analogous to (4.5); a 
single term with azimuthal quantum number L has now the 
multiplicity 2/(2L + 1). We next reduce 


D = È Dy (4.7) 
and finally, as the last ‘step, we carry out the reduction 
D, X D = TDy, J=L+s,L+s—1, C. |\L — sl), (4.8) 


associated with the coupling between the spin and the orbital 
moment of momentum. The terms which result from this 
last reduction form together a multiplet. Each multiplet is 
therefore associated with a definite azimuthal quantum number 
L and a spin quantum number s; the individual members of 
the multiplet are distinguished by the inner quantum number J. 
We call 2s + 1 the multiplicity, although the number of terms 
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in the multiplet is only actually equal to this when L => 5s, as 
by (4.8) their number is less if L <s. The -dimensional 
representation Dj is even or odd according as f is even or odd. 
The reduction (4.7) into irreducible constituents accordingly 
yields only integral values for s when f is even and only half- 
integral values when f is odd: The term multiplicities alternate 
regularly between even and odd as we run through the atomic table 
in the order of increasing atomic number (H even, He odd, Li 
even, Be odd, etc: “ alternation law”). For f= 2 we have, for 
example, | 


D = Do -+ D. 


It is empirically found that the bivalent alkaline earth metals 
have in fact a singlet and a triplet system of terms. Butin the 
triplet system the S terms, for which L = 0, are simple; only 
the P, D,» - + terms have the actual multiplicity 3. 

Instead of considering all the electrons at once as in (4.6) 
we can build up the atom by successively adding one electron 
after another. On adding a next electron, say the ft*, to an 
atom or an ion A*, a multiplet of At characterized by azi- 
muthal quantum number L and spin s breaks up into all those 
multiplets contained in the representation (Ð, x D) x (Dr x D)), 
where J, = lis the azimuthal quantum number of the electron 
added. Since 


>, X Dy = Derg + Dej 
Di X D= SD, L*=L+1, L+i—1 +--+, Li, 


this results in multiplets (s*, L*), one for each of the pairs 
=a I*=L+, L+1—1, +, |L—I] (49) 


( branching rule’’). The alternation law is again contained in 
the first of the above equations. It is to be noted, however, 
that the Pauli exclusion principle for equivalent orbits, which 
will be discussed in part C of this chapter, materially restricts 
the array of multiplets allowed by this rule.’ 

Again applying (3.12) to the composition of spin and orbital 
moment of momentum, we find that the 2J + 1 components 
into which a J term of a multiplet (s, L) is split in a weak magnetic 
field are displaced from the unperturbed positions by the amounts 


hog:m (m=J]J,]J—1,:: — J) (4.10) 
where the separation factor g is given by 


g=1+ i+) Se (4.11) 
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This is exactly the formula which was derived empirically by 
Landé ; we here see the importance of the fact that the square 
of the absolute value of the moment of momentum § (or Q or © 
is calculated from the quantum number J (or Lor s) by J(J + 1), 
etc., instead of J?, etc., as in the older quantum mechanics. 

When the magnetic field increases to such an extent that the 
magnetic separation becomes comparable with the separation 
between the terms of the multiplet we must handle both the 
perturbation to which the multiplet separation is due and the 
magnetic perturbation together. In order to express the small. 
ness of the term in the Hamiltonian function to which this 
former perturbation is due, we introduce a factor p which will 
appear in the same way as the factor o in the magnetic term; 
the case of a weak magnetic field may then be expressed by 
saying that o is small in comparison with p. We can consider 
o and p as variables which increase gradually from 0 to their 
actual values and follow the dependence of the separation on 
their ratio. We therefore write the perturbation term in the 
Hamiltonian function in the form 


W = pW' + oW”. 


Since the decomposition (4.8) need not for present purposes 
be expressed in terms of its ultimate constituents, the individual 
electrons, we may here denote the azimuthal and inner quantum 
numbers by l and j. Let the representation spaces of D, D, 
be ta &, with co-ordinates &(m,), x(m,) respectively. Denote 
the moments of momentum M., WM, of these two representations 
by 8, & respectively ; if the magnetic field has as its direction 
the z-axis, then 


W” = h(L, + 2s,). (4.12) 
The co-ordinate system is again to be so chosen that the rotations 


about the z-axis appear in reduced form; to such a rotation 
of angle ¢ corresponds the transformation 


Elma) > e(— mh) + Elme), x(m) > e(— me) + x(m:) ; 
the range of the quantum numbers m, and m, is given by 
5 —1. (4.13) 


The variables of r, X R, then behave like the (2s + 1)(2/ + 1) 
products 


M: =S,S— l, —s; m=Ll—1l, 


Elms) + %(m)) (4.14) 


and are multiplied, under the influence of a rotation $ about 
the z-axis, by e(— md), where 


m = mM, + m, 


a- Oe a arerp ee y 


Ji ee AEA me e o- o 
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We now reduce D, X D, into its irreducible constituents ®;. 
Yet the co-ordinates of the (27 + 1)-dimensional.irreducible sub- 
space of t X R, in which the representation D, takes place, 
be denoted by 


xj; m) (m=j,j7—1, +++, — j). 


am is the magnetic quantum number, i.e. under the influence of 
the rotation ¢ about the z-axis x(j ; m) is multiplied by e(— mẹ). 
The co-ordinate transformation which leads to the complete 
reduction of D, X ®, into its constituents D; is obviously of 
such a kind that «(7 ; m) is a linear combination of those of the 
wariables (4.14) for which m, + m, has the value m. 

If the unperturbed system possesses no accidental degenera- 
tion the separation is determined by that part of the matrix 
(4.12) in which the sub-space ts X R, of RI intersects itself. 
We must therefore solve a secular equation G of degree 
(2s + 1)(2 + 1); but the problem is materially simplified by 
the fact that the perturbation term possesses rotational symmetry 
about the z-axis, as the only non-vanishing elements of the 
matrix W are those for which m > m. The one secular equation 
G is consequently broken up into 2(/ + s) + 1 secular equations 
Cm corresponding to the possible values 


m=I-+s,l+s—1,-+-+,—(1+s) 


of m. The degree of Gm is given by the number of possible 
Partitions of m into two summands m, + m, which run through 
the ranges (4.13). In the case of a single electron, f= 1, we 
have only equations of the first and second degrees, and the 
calculation can therefore be carried through completely for this 
case. ! 

The roots of the secular equation Gm are the displacements 
of the energy terms due to the perturbation. Since the trace 
of a matrix is an invariant, the sum of the term displacements 
which are associated with a definite value m of the magnetic 
quantum number (the roots of the secular equation Gm) is equal 
to the sum of the terms in the principal diagonal of this portion 
of W, i.e. to | 

W(ms m, m,m). 
(m, + my = m) 


It is therefore a homogeneous linear function of p and o (“ sum 
rule”). We obtain the part due to the magnetic field by putting 
P = 0; by (4.12) this is 


oW” (m,m, m, m,) = ho(m, + 2m,). 
14 
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On the other hand, the formule (4.10), (4.11) determine the 

term displacements in the case in which o is small in comparison 

with p. In consequence of the sum rule these two results must 

agree. land s being fixed once and for all, we denote the Landé 
g-factor (4.11) by g(7), and we then have 


Dim, + 2m.) = m Xgl). 


The sum on the left is extended over all partitions of m=m; +m, 
for given m, and that on the right over all values of 7 which 
are consistent with the conditions 


j= |m], [m| +1,1; fHltsl+s—1,--+, E sl. 


g(j) can in fact be determined from this equation. For m=l+s 
both sums reduce to a single term ; we then have 


l+ 2s = (l+ s)-g(l+s). 


For m == l + s — 1 there are two possibilities for (ms, m,) and 
two for j: m, = l, m, =s — 1 or m =1— l, m, =s; J =l--s 
or l+ s— 1. Consequently we must have 


2l + 4s — 3 = (l + s — lj{g(l + s) + gl + s — 1)} 


In this way we obtain recursion formulæ for the successive 
calculation of g(l + s)ù, g(l+s— 1), +». The reader can 
readily verify that the result of the first few steps agrees with 
4.11). 

i is to be noted that in following the terms from a weak 
to a strong magnetic field they cannot cross each other, con- 
sidered as functions of the monotonic increasing parameter 
o:p; the “singular elements’’ of a unitary group, i.e. those 
elements for which two or more characteristic values coincide, 
constitute a manifold of three, and not simply one, fewer 
dimensions. 


B. The Lorentz Group 


§ 5. Relativistically Invariant Equations of Motion of 
an Electron 


We have as yet obtained no specific expression for the spin 
perturbation; that for the magnetic effect due to an external 
field was set up with the aid of the experimental facts. It is 
clear that we can arrive at a satisfactory theory of the electron 
only when we are able to express its fundamental laws of motion 
in a form which is invariant under Lorentz transformations, as 
required by the restricted theory of relativity. The solution of 
this problem is due to Dirac. We saw in III, § 8, how the 
2-dimensional representation , of the rotation group, which, 
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following Pauli, characterizes the covariant quantity b == (p, Ya) 
describing the wave field, can be extended to the group of 
positive Lorentz transformations. Yı, %, play the same rôle as 
the variables é, ņ introduced in connection with D}. 

Following de Broglie we took as the wave equation of a 
particle of mass m in field-free space 


92 92 d? 1 d? 2 cm 
Gatta aat (m=). 6 


But this equation is not in agreement with the general scheme 
of quantum mechanics, which requires that only first order 
derivatives with respect to the time appear. The formulation 
of a relativistically invariant differential equation satisfying 
this requirement is, as Dirac discovered, made possible by the 
transition from the scalar wave function ẹ% to one with two 
components. We seek to derive these dynamical equations 
from a Hamiltonian principle. 
Let 


Xo = ct, XS Y, Xa =y, Xe 8 
constitute a normal co-ordinate system in our 4-dimensional 
space-time. If the quantity w is of the same kind as y, the 


quantities ý Saw behave, in accordance with III, (8.16), like the 
four components of a 4-vector; the S, are the matrices defined 
in ITI, (8.15). Hence in particular 


3 o o| oy 
Sy —dx 
mov 0% 8 p 


are the components ds, of an infinitesimal vector; we are here 
dealing with a linear correspondence which is independent of 
the co-ordinate system employed and which sends the vector 
dx over into ds. Its trace 


jst (5.2) 


is consequently a scalar and its integral (multiplied by 1/7) 


M = ; | Dx Sa xæ «dx (dx = d%o dx, dx, dxs), (5.3) 


extended over any finite portion of the world, is a quantity which 
is independent of the co-ordinate system.” 


* The letter M used for the material part of the action is not to be confused 
with the moment of momentum. 
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Although M may not be real, it is practically real in the sense 
that M — M is the integral of a complete divergence. For 
since the S, are Hermitian matrices, 


— Iode, 
M=— \s2s.y dx 


and M — M is in fact the integral of 
1 2(b S, 
120 Sa) 


15 a 

In using M as an action we are not interested in M itself, but 
only in its variations 5M caused by arbitrary infinitesimal 
variations ò% of % = (fı, $a) which vanish outside of a given 
finite portion of the world (the integral is then extended over 
the entire world or, what amounts to the same, over this finite 
portion). The circumstances mentioned above guarantee that 
6M is real; on writing it in the form 


ôM = (w + ©- Sib)dx 
we find on comparison with (5.3) that 
_ ly, 
w = 7 2 Seay 
We thus arrive at the first order differential operator 


ð 
V= OETA (5.4) 
From the invariance of (5.2) it follows that this operator trans- 
forms } = (fı, Pa) into a quantity y = (pi, Y2) which trans- 
forms contragrediently to % = ($, a) under the influence of 
an arbitrary positive Lorentz transformation. If we wish to 
guarantee that M is real, we may replace the original definition 
by 


o 1l ~o db W 
M = zl 2 (PSSE — ES p) dx. (5.5) 

In HI, § 8, we found it necessary to introduce quantities 
Wi Y2 which transform contragrediently to J, Ý in order to 
be able to extend the restricted Lorentz group to the complete 
group. And just as V applied to % generates a quantity of the 
kind y’, in the same way the “ conjugate ” operator 


Var! 


a Xs 
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transforms #’ into a quantity of the kind y. V'V is, as is readily 
verified, the operator 


(542 ua 
dri Oe dE 


Consequently equation (5.1) for p, P can be written in the form 


1 
7 Ve + My W = 0, 

i (5.6) 
FVW + Mo ub = 0 


on introducing an auxiliary pair of components Y. From now 
on we denote the column of the four components By, Pa; Pi Pa 
by % and employ S« as the symbol for the transformations of 
these four components as in the latter part of Chapter III; 
with this understanding the differential equations (5.6) arise 
from an action integral which is composed additively of the 
quantity M, (5.3), and the invariant (cf. III, (8.19) | 


MW = mo| pT - dx. 


M and M’ are also invariant with respect to interchange of 
right and left, and under the spatial reflection i in the origin. 

In accordance with the general scheme of quantum mechanics 
the differential equations for ẹ should, as already remarked, 
contain only the first derivative of 4% with respect to time; the 
additional requirement that it be relativistically invariant then 
leads to the conclusion that it can also contain only first de- 
rivatives with respect to the spatial co-ordinates. We have 
here been able to satisfy these requirements without altering 
the actual content of de Broglie’s equation (for the components 
Pa Ya); the equations thus obtained are to be taken as the 
equations for a free particle. This formal transition to first 
order equations will become physically significant only when 
we pass to the derivation of the equations of motion in an electro- 
magnetic field with the aid of the principle of gauge invariance 
developed in IT, § 12. , According to it, if — ¢, is the scalar and 
$1, $2, $3 the vector potential, we must replace 


d 
OX 


“j — 


1 2 € | 
by int he? (5.7) 
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It will be found convenient in the following to introduce the 
quantities f, obtained by multiplying the potentials ¢, by the 


factor E Then in 


hc 
M = z| Vde (5.8) 
the operator V is defined by 
l 1 2 
V= VAE st fa). (5.9) 


Because of this gauge invariance the quantities M, M’ are 
unchanged on replacing simultaneously 
p by er and fa by fa—2 (6.10) 


where A is an arbitrary function of position in space-time. Now 
take A to be an infinitesimal function which vanishes outside 
a certain finite portion of the world; then 8M and 5M’ must 
automatically vanish for the variations | 


òb = iÀ tp, if, = ——. (5.11) 
The complete expression 
3(M + M’) = [[(8h-@ + B+ BY) + Ls dfaldx 


for the variation automatically tells us that under the assumption 
that the laws of matter (5.6) are satisfied, i.e. that w = 0, 


8(M + M’) = | Xs*f, + dx. 


Hence we have as a consequence of the laws of matter 


a. OA _ fy. pS O 
_ [zs Z de = fa Zdr = 0, 
i.e. the continuity equation 
yi = 0. (5.12) 


a DK 
A glance at the explicit expression for M shows that 
st = bSy h ; (5.13) 


these are the quantities which formed the starting-point for 
the theory of the transformations of } as developed in III, § 8, 
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and we already know that they form the components of a 
4-vector which is independent of the particular space-time 
co-ordinates employed. The time component 


so = ph = (baths + haha) + (Wi, + dos) (8-14) 


is the probability density and hence c8 = c(s!, s?, s?) is what 
may be called the probability current: in order to obtain the 
number of particles which will on the average pass through 
a surface element do in time unit, multiply the total number of 
particles present into the product of the area do and the normal 
component of the vector c3. On integrating the equation (5.12) 
over a volume V we find that the increase in the mean number 
of particles in V per unit time is equal to the mean number of 
particles entering V through the surface in unit time. In 
contrast to the provisional scalar theory, the Dirac theory leads in 
a most natural way to expressions for the probability density, as 
well as the probability current, which depend on & alone. 
On integrating 


fs dx, AX» dx 


over the whole of space we find that the integral is independent 
of time—and, in accordance with the statistical interpretation 
of y, is to be so normalized that its value is 1. Consequently, 
in the dynamical law 


the energy H/h is a Hermitian operator, as should be. We 
shall from now on take h as the unit of action, with corresponding 
units for linear and angular momentum. The result of this 1s 
that the quantity k disappears completely from the laws of 


quantum mechanics. With the usual abbreviation, pa = ae 
1 3 
Hi = Jo tA S, (pr + fe) + mo T. (5.15) 


The influence of the electro-magnetic field on the matter is 
taken care of by (5.9), but, on the other hand, the matter gener- 
ates the electro-magnetic field in accordance with Maxwell’s 
equations. In order to express this explicitly we must add to 
M + M’ the Maxwellian action 


= [Uh + fi +f) — (te + fo + flix (6.18) 


216 APPLICATIONS OF GROUP THEORY 
of the electro-magnetic field, where the 


fg = h Hf 
es OX BET 


are the field strengths—which are unaffected by the change of 
gauge (5.10). F is obtained from 


5| (or — enavae (5.17) 


2 2 g 
by multiplication with e(%) =<, (6.17) is the action in 
Heaviside units, which are best adapted to the electro-magnetic 
field theory. Since we have taken h as the unit of action, the 
total action of our system, consisting of matter plus field, is 


w=M+m'+iF ( = 5) (5.18) 
= M+ + z c= 7): . 
For reasons which will be apparent later the real number «/42r 
is called the fine structure constant. Whereas the variation 
of the yin the Hamiltonian integral [W - dx yields the equations 
of matter, variation of f, leads to the equations of the electro- 
magnetic field with 


— es = e Sa (5.19) 


appearing as the 4-vector of charge and current density. The 
only constants occurring in the field equations are the two 
combinations 


cm e? 
M = --; o= 


(5.20) 


of fundamental atomic constants: the first is a reciprocal 
length and the second a pure number. 

Schrödinger, in his fundamental papers on wave mechanics, 
thought he could explain the quantum behaviour of matter 
and radiation “ classically ” by setting up a closed system of 
field equations such as we have obtained above. In particular, 
he held that the charge of the electron was actually “ smeared ” 
over the whole of space with the density —e-s®, But there can 
be no doubt at the present time that the field equations are not 
to be interpreted in this classical manner; they must rather 
be interpreted in accordance with the statistical view-point 
developed in Chapter II. The expression (5.14) for the density 
then guarantees the atomistic structure of electricity. To show 
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this we first remark that the charge in a volume V is represented 
by — e times the Hermitian form 


{ j J dobdexdacydarg, 

(F) | 
But this is an “ idempotent form with respect to the “ vector ”’ 
y%; its characteristic values are 1, 0 and the corresponding 
characteristic functions are those quantities % which vanish 
outside or inside V, respectively. The charge contained in V 
is accordingly capable of assuming only the values — e and 0, 
i.e. according to whether the electron is found in V or not. In 
order to guarantee the atomicity of electricity the electric 
charge density must equal — e times the probability density. 
But if we base our theory on the de Broglie wave equation, 
modified by introducing the electro-magnetic potentials in 
accordance with the rule (5.7), we find as the expression for the 


charge density one involving the temporal derivative OY in 


addition to #; this expression has nothing to do with the prob- 
ability density and is not even an idempotent form. According 
to Dirac this is the most conclusive argument for the stand 
that the differential equations for the motion of an electron in 
an electro-magnetic field must contain only first order derivatives 
with respect to the time.!3 Since it is not possible to obtain 
such an equation with a scalar wave function which satisfies at 
the same time the requirement of relativistic invariance, the 
spin appears as a phenomenon necessitated by the theory of 
relativity. 

The theorem of the conservation of electricity (5.12) follows, 
as we have seen, from the equations of matter, but it is at the 
same time a consequence of the electro-magnetic equations. 
The fact that (5.12) is a consequence of both sets of field laws 
means that these sets are not independent, i.e. that there exists 
an identity between them. The true ground for this identity 
is to be found in the gauge invariance, for it is equivalent to 
the assertion that 5W vanishes identically when ẹ and f, are 
subjected to variations of the form (5.11). We have 


BW = f{(Bh-w + a- 8) + SL” fajde, 


where w= 0 are the equations of matter and L*=0 the 
Maxwellian equations. On substituting the variations from 
(5.11) and integrating the last term in the integral by parts, 


kanai 


l ~ oL* 
[($o— oY) + I= =0. 
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Because of the arbitrariness of the gauge the number of inde- 
pendent equations must be one less than the number of unknown 
functions ý and fa. 


§ 6. Energy and Momentum. Remarks on the 
Interchange of Past and Future 


I. Energy and Momentum. 

The complete field equations are explicitly 
l? : 

ESG se + fa) + ma TH = 0; | 

p 1 WX, (6 1) 


div € + p= 0, © ewig —a.| 
0% 


Where © and § are the electric and magnetic field strengths : 


UA A)... pp Ae. a is, ga 
E,= (3-3), m=: A r (6.2) 


p is the charge density np, and the components $s, - > - of the 
current 3 are given by 


S1 = ý Sý, RN (6.3) 
In addition to the differential law 


op EN 
da + div 3 = 0, (6.4) 


expressing the conservation of electricity, we have a vector con- 
servation law governing energy and momentum. A completely 
satisfactory expression for the tensor representing density and 
flux of energy and momentum is only to be obtained along the 
lines employed in the general theory of relativity. Here we 
give only the result for the density of energy — c- # and mo- 
mentum (4, #, 4), and in doing so we separate the material 


from the electro-magnetic part. We have for the part referring 
to matter 


~8= 3 2 {F5(g + A- (2 as): 539} 
i T + mo JTH; (6.5) 
=) ’ ’ 
a a 


We have here introduced, in addition to Sp, the operator S, 
(p = 1, 2, 3) which acts on all four components of #; whereas 


-ë 


one oe q 


mn ea a^i 
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the former subjects %,, %, to the 2-dimensional transformation 
Sp [III, (8.15)} and yi, %2 to — Sp the latter exercises the 
same 2-dimensional transformation S, on both pairs of com- 
ponents. Correspondingly 


s, = YS. 


The density of energy and momentum due to the electro-magnetic 
field is given by the familiar Maxwellian expressions 


-8 = S{(El+ H + (H + +)}; 


(6.6) 
t = «(EH — EH), + © «. 
We find the conservation laws 
3 pf 3 d 
2s = 95 ino %q_ °, (6.7) 


as consequences of the field equations. Furthermore, the tensor 
t is symmetric—not identically, but in consequence of the field 
equations; in this sense we have 


f+ = 0 (p= 1, 2,3); f= (p,¢=1, 2, 3). (68) 


On combining these with (6.7) we obtain the divergence con- 
ditions 
>, (Xats — Xa tz) 


pA TA = 0t. :; (6.9) 
8 æ æ 

> 2(%q ty + avy 19) — 0, oa. (6.10) 

aml ÒX x 


These results can all be verified directly, but their deeper 
significance can be understood only by going over to the general 
theory of relativity as mentioned above. Just as the theorem 
of the conservation of electricity follows from the gauge in- 
variance of the equations, the theorems for the conservation 
of energy and momentum follow from the circumstance that 
the action integral, formulated as in the general theory of 
relativity, is invariant under arbitrary (infinitesimal) transforma- 
tions of co-ordinates. In this general relativistic formulation we 
need further to erect a normal set of co-ordinate axes at each 
point P of space-time, consisting of four mutually perpendicular 
directions at P (‘‘ orthogonal ennuple”’), in order to fix the 
metric at P and to be able to describe the wave quantity # in 
terms of its components; all permissible orthogonal ennuples 
at P are obtainable from each other by local Lorentz transforma- 
tions which leave P invariant. But the rotations of these local 
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ennuples can be performed in the various points P quite inde- 
pendenily—the quantities at various points are not bound to 
each other as in the special theory of relativity. The symmetry 
of the energy-momentum tensor can be traced back to the 
invariance with respect to such rotations. One can in fact 
take it as a general rule that every invariance property of the 
kind met in general relativity, involving an arbitrary function, 
gives rise to a differential conservation theorem. In particular, 
gauge invariance is only to be understood from this standpoint. 
It follows from the transformation laws for % that its four com- 
ponents #, relative to the local ennuple are determined only to 
within a common factor e'4 of proportionality, the exponent A 
of which depends arbitrarily on position in. space-time; in 
consequence of this it is necessary, in order to obtain a unique 
covariant differential for %, to set up a linear form Df,dx, which 


is coupled with the gauge factor contained in % in the manner 
required by the principle of gauge invariance." 

We obtain the integral conservation laws from the differential 
ones by integration. We set up the integral 


fdv = J, (dV = dx, diy des) 


over a section % = const. of space-time and find that it is 
independent of x. —cjJ =H is the energy and (J, Js, Js) 
the linear momentum. The material part is, on a simple in- 
tegration by parts, i 


— Ji = Hz Sh a + fp) + mo ya ; 
I= fE- z Sar, cate 


These are Hermitian forms in the “ vector ” $. They again lead 
. 1/d> d a 
us to associate the operators = (S, —, — 
1 DXI We IX 
(Ji; Ja Js) of linear momentum, i.e. to the assumptions with 
which we, following de Broglie and Schrödinger, began. For the 
energy we obtain (on dividing by c) the operator 


SH = 2 S,(1 LA + fo) + moT, 


c 1 Xp 


without the additive term f, as in (5.15) ; the differential equa- 
tions of matter are therefore 


G so + fojt + THY = 0. 


) with the components 
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Moreover, we must not forget that to the part due to matter 
we must yet add that due to the electro-magnetic field. 
The quantities 


My = J (xq — x DdV, + + +, (6.11) 


which are by (6.9) also constant, are the components of the 
moment of momentum. We find from (6.5) that the part due 
to matter is 


— (Fix, 2 XV lo 
M, = (4; (a — 5) + 5Si}yar, cos, 


In agreement with our earlier assumptions we here obtain the 
operator which 1s composed of the sum of the x,-component 
* (x4. — rsz) of the orbital t of d th 
i 3x, ETA al moment of momentum and the 


spin moment of momentum zS. The vector 
1 b l ? ? LU 
9 © = 9 (Si, So, S3) 


is actually the spin, for in accordance with the law of trans- 
formation of both # pairs (pı, Ya), (Yi, Y3) of components suffer 
the same transformation o as in the Pauli theory of the spin 
under the influence of the transformation ø (spatial rotation) 
of U. 

On integrating equations (6.10) over the spatial section 
xə = const. we obtain | 


d 
Ji = — gp V, + a, 


which we may consider as the law of inertia of energy. The 


integral may be written Jeo’ £ = — a, é, where &,, &, & are 


the co-ordinates of the ‘ centre of energy’’; the equations are 
then 


dy 
Si = 3p 


We thus obtain the familiar mechanical law: Momentum 1s 
equal to mass times velocity, where the velocity 1s to be taken as 
that of the centre of energy and the mass as 1/c? times the energy 
content of the field. Nevertheless it is advisable not to divide 
by H in defining the centre of energy, as the energy density 
— £ is here no longer positive-definite, and we cannot be certain 
that the energy content H will turn out to be positive. 
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Our theory is a classical field theory, the quantum features 
entering only in the statistical interpretation. With this 
interpretation the field laws are concerned with a single electron. 
At the present stage of our development we can deal only with 
the additional quantities due to the electro-magnetic field by 
assuming a given external field affecting the motion of the 
particle, without the particle reacting on the field; we must 
then surrender our Maxwellian equations. The true laws 
governing the interaction between electrons and quanta will 
only be obtained, in analogy with II, § 138, on subjecting the 
system of field equations to the process of quantization, just 
as was done by Heisenberg for any system of classical mechanical 
differential equations. 

The fact that we are led back to our original assumptions 
concerning the operators representing position and momentum 
is due to the particular expressions we have chosen for the 
action, from which the field equations were obtained; indeed, 
it depends entirely on the part M. These original postulates 
of quantum theory are accordingly of less interest from the 
standpoint of general principles than we at first believed. But, 
on the other hand, this connection seems to indicate that M 
cannot be replaced in its rôle as representing the action due 
to matter. M is also responsible for the fact that the charge 
and probability densities agree, which is unconditionally re- 
quired as a guarantee of the atomistic structure of electric 
charge. These connections with the most fundamental physical 
observations thus require that the action be composed additively 
of M and further terms which are invariant not only under 
change of gauge (5.10) as is M, but also on replacing w by e*4- and 


fa by fa — ot, where A and p are two independent arbitrary 


functions in space-time. J’ and the Maxwellian action F are 
in fact of this kind. Further relativistic invariant scalars 
satisfying these conditions are readily found—indeed it is not 
difficult to set up the most general action possible with the 
quantities at our disposal. But we have yet to be convinced 
by physical observation that the three quantities M, M’, F 
here employed do not suffice. 


II. Electric and Magnetic Spin Perturbations. 


In order to be able to compare Dirac’s theory with the facts, 


we eliminate yi, Y3 in the same way as we did in the absence 
of the electro-magnetic field. We obtain the equation 


— Vb = mip 
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with the new definition (5.9) of V and V’. The substitutions 
S, in two variables satisfied the equations 


SSi = Siso = S1; S253 = — SaS = 1S; ; 


and consequently those denoted by the same letters but operating 
on all four variables obey 


Sosi = Siso =: Sy ) SaS == == — SaS == = 1S}. 


V’'V contains terms of the following four types ; 

0) (S-+if) (+i), 

a -ENEH 

(8) Attn EnEn) 


TS b ò . 
(8) — A554 (5, tA) Cae, +) — (Se +) Gat 
We collect together terms of types (1) and (2) to form the 


“ regular term ’’ in which the components of # are not coupled 
with each other: 


TETTRE 
ia + Eff + “of a 


[The transition from lower to upper indices, i.e. from ‘“‘ co- 
variant” to ‘‘contravariant '’ components, is performed in 
accordance with the equations P= — f, P = hlp = 1, 2, 3).] 
The irregular term consists of the electric part 


ò d 
is (4 — vi) ++ = “(Si fu + +) 
and the magnetic part 
fs _ Ya 
Si(se — z+) + + = (Sifes + +). 
These become, on ultistying by the factor h and expressing 
the electric and magnetic field strengths © and in the usual 
units, 
€ Cm, 
(CO, Sg. 
We have already (II, § 12) calculated the regular term for a 
homogeneous magnetic field and found it to be - - (2). On adding 
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the regular and irregular terms we obtain, on neglecting the 
squares j of the potentials, 


“(§, $+ ©). 


This contains the fact, which was already derived in § 4 from 
ee , 
spectroscopic data, that to the spin 5 ©’, twice as great a magnetic 


moment is to be ascribed as to the same amount of orbital 
moment of momentum; we have now obtained a convincing 
theoretical foundation for this procedure. The laws governing 
the interaction of a general inhomogeneous magnetic field with 
orbital and spin momenta emphasize still more emphatically 
the essential difference between & and ©’. The irregular electric 
term, calculated for the central-symmetric field originating in 
the nucleus, is the spin perturbation. 

The description of the electron given earlier, according to 
which it was a composite structure composed of two kine- 
matically independent parts—the electron translation, with an 
œ -dimensional system-space, and the electron spin, with a 
2-dimensional system space—is, in view of the Dirac theory, 
no longer quite appropriate. But the classification of spectra 
given there is none the less valid here, for it depends only on 
the fact that to the group of rotations of physical space corre- 
sponds the representation D, x € in the total system-space. 

From the field equations (6.1) as they are to be understood 
for the present, i.e. as the laws of motion of an electron in an 
external electro-magnetic field, dispersion phenomena can be 
(approximately) calculated ; they tell us how the motion of the 
electron in the normal or other quantum states is affected by 
the incident light wave. From the perturbed % we then deter- 
mine the scattered light with the aid of Maxwell’s equations ; 
to this class of phenomena belong in particular the Compton 
and Smekal-Raman effects.!5 Spontaneous emission can be 
handled similarly if we take the considerations of II, § 18, as 
justifying the following procedure: The polarization and 
intensity of light emitted by the quantum jump n —> n’ of the 
atom is to be calculated by integrating Maxwell’s equations, 


where the expressions hh, eN for charge and current density 


are to be understood as yn, PSY, Y™ being the 
characteristic function of the atom in the n? quantum state. 
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ITI. Interchange of Past and Future. 


The action is so constructed.that it is invariant under inter- 
change of right and left ; the corresponding substitution is 


Xo —> Xo, > i) — 123 
lo fo. Jo>-— fr; p= 12,8) (6.12) 
p>, we p; b> ty, be > de, 


Does a corresponding result hold for the interchange of past 
and future? The foundations of the theory lead to the hope 
that it will be able to take account of the essential difference 
between the two time directions, so obvious in Nature. But 
Dirac has remarked that M, M” go over into — M, — M’ under 
the influence of the substitution 


Xa > — Xn fa > — Sa (a = 0, 1, 2, 3) ; 1 
by > Yh, be > pa; p> — ph, by > — pa 


Hence when, in dealing with the motion of an electron in an 
external electro-magnetic field, we obtain a solution y% which 
contains the time in the factor e~t, this substitution will lead 
us to anew solution which contains the time in the factor e”! ; or, 
more precisely, a solution of the problem obtained by changing 
finto — f. But this can be done by retaining the same external 
field with potentials ¢ and replacing e by — e. We denote such 
a particle, whose mass is the same as that of the electron but 
whose charge is e instead of — e, as a “ positive electron ” ; it 
is not observed in Nature! It follows from what has been said 
above that the energy levels of such a particle are — hy, where 
hv are those of the negative electron. JDisregarding this differ- 
ence in sign, the two particles behave the same. The electron 
will possess, in addition to its positive energy levels, negative ones 
as well, the latter arising from the positive energy levels of the 
positive electron on changing signs'as above. Obviously some- 
thing is wrong here ; we should be able to get rid of these negative 
energy levels of the electron. But that seems impossible, for 
under the influence of the radiation field transitions should occur 
between the positive and negative terms. That we have twice 
as many terms as we should is obviously related to the fact 
that our quantity p has four instead of two components (satisfying 
first order differential equations). The solution of this dif- 
ficulty would seem to lie in the direction of interpreting our 
four differential equations as including the proton in addition 
to the electron. 


(6.13) 
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The substitution (6.13) transforms the terms M, M’ of the 
action into — M, — M’, but leaves the Maxwellian term F 
unaltered. Our field equations as a whole, i.e. when we also 
take into account the reaction of the particle on the radiation 
field, are consequently mot invariant under this substitution. 
However, there does exist a substitution which reverses the 
direction of time and which at the same time leaves all terms in 
the action invariant. We mentioned in III, §8 that the ex- 
pression (5.13) formed from a ẹ with two components takes on 
the sign ò, : 4) = 1, 6, = — 1(p = l, 2, 3) on going over from 
by, Pato Ja — oy. Hence if w is a quantity which transforms in 
the same way as # then 


Ý Saw > 84° D Sy; 
oy 


on applying this to w = 20%, dxs we find that 


~a dy dab 
Pare, —> Òx . ep ws. 


Hence if we make in addition the substitution 
Xo > — Xo %yp>%_ (p= l, 2, 3) 
then 
3 w dY 3 dpb 
Za — Zin ot 
and consequently M, formula (5.5), remains invariant. In the 


presence of an electro-magnetic field its components must 
change signs in accordance with 


h>, f> — fs (p= l, 2, 8). 


We have thus found that M, M’ and F all remain invariant 
under the substitution 


Xo —> — Xo, Xp —> Ky; 
Jo —> fo, fo > — fo: je = l, 2, 8) | (6.14) 
hot, heh; Kok, eo —¥ 


This shows that the past and the future enter into our field 
theory in precisely the same manner—in spite of the fact that 
the sign in the exponent of the time factor e~”! of a solution of 
the quantum problem is unchanged by the substitution (6.14). 
We must of course suspend judgment as to whether the laws 
governing interaction between photons and electrons allow us 
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to distinguish between these two directions in time until we 
have carried through the quantization (§ 12). 


$ 7. Electron in Spherically Symmetric Field 


We now proceed to the discussion of the behaviour of an 
electron in a spherically symmetric electrostatic field in Dirac’s 
theory. 

I. Dirac’s Conservation Theorem. 


From the definitions follow immediately the commutation 
rules : 
S,T = —TS,, S,T=TS, (p= 1, 2, 3). 
We need further the results 
SiS, = 1, SaS; = — SS; = iSi 
and the commutation rules 
o Lipi — Pilar = 9, (P8) = pili + Pole + pals = 9, 
Li pa — Pa La = ipa, Lepi — pile = — ts, 


for the components of linear and angular momenta p = (Py, P», 
ae oe p = (Pu Pa Ps) 
In a spherically symmetric electrostatic field fı = f = fs = 9 
and fy = @ is a function only of the distance 7 from the centre. 
With the aid of the formule given above it is easily shown that 


Las 
M,=Li+ 551 


commutes with Ø, T', (G’p) and consequently with each term in 
the expression 


IH = + (Gp) + mol (7.1) 


for the energy H. Indeed, this conservation law for the total 
moment of momentum M = & + 5) was already known to 


us from general considerations. We further find that (S'R) 
commutes with ® and T, but that 


(S'L)(S'p) + (S'p)(S'R) = — ASP) 


(S'p){(SQ + 1) + (SY + 1{S'p) = 0. 
Hence (6’2) + 1 anti-commutes with (G'p) and therefore also 


with (Gp) ; its commutation properties with respect to the three 


or 
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terms of (7.1) are therefore the same as those of T. Hence on 
setting 


(SR) +1 =kT, (7.2) 


k is a scalar which commutes with the energy H (where by scalar 
we mean invariant under the group of rotations of space). 
Consequently we can decompose the system-space of the electron 
into irreducible sub-spaces R, associated with the rotation 
group, in such a way that the quantity k, which we call the 
auxiliary quantum number, as well as the energy H, 
possesses a definite value in each of the sub-spaces. Now 


(GL)? = {Li + +} + {S2S3Lals — Lala) + +} 
=U — (SL, + +) =? — (SR) 


and consequently 


2 
(SQ) + 1 = L H (S69+1= (3 + 58’) += m +1 
i 
Ne? = k? — 4 
This agrees with 
y o db? 1 
=j += (i+) i (7.8) 
when we put 
, 1 1 
j= |kl—5 k =F +5 (7.4) 


Accordingly, the auxiliary quantum number k is a non-vanishing 
integer. The conservation theorem (7.2) goes beyond (7.3) in 
giving us in addition the sign of k. For a given half-integral j 


the two values k = + ( j + 5) are both possible; they must 


correspond to the two possibilities l = j +5 of our previous 


notation. The single quantum number k replaces the two l, j. 


II. The Differential Equation for the Deiermination of the 
Characteristic Values. 


Since the field is spherically symmetric, it suffices to carry 


through the calculation for the point x = 0, y = 0, z==r. At 
this point 
_ gD Zð _ 
L = t dy’ Ly= > 5y L; = 
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and the Dirac conservation law (7.2) becomes 
Ò „ò , 
ENE 
5 
“(33 + ih = = kh — hh | 


together with the equations obtained from these by interchanging 
the two pairs p, Ya and Wi, pa of components. The differential 
equation (6.1) for the characteristic vector %, which contains 
the time only in the factor e~’”, has as its four components the 
two 


(7.5) 


[0 
Upi + i(< — is) e + a _ Mo, = 0 
, (7.6) 
; ./ò .o\,> 0 
Up, — 1 (= — is, ie — oi — Mop, = 0 


and two others of analogous structure; we have here written 
Vv 
E = T E — @ = U. 


The derivatives with respect to x and y which appear in (7.6) 
can be eliminated with the aid of (7.5); the resulting equations 


are 
[tiG +g) |i (m+ Feo 
[P -G +H) e+ (—m + = 


f= (0, 0, r), £ = (0, 0, r). 


The remaining two equations are obtained by writing (Y2, pa) 
in place of (%,, pi). At an arbitrary point P = P(x, y, z) the 
first and third components of ẹ satisfy the equations (7.7) in 
a rotated co-ordinate system whose positive z-axis passes through 
P. Weshall find it convenient to introduce rf and rg as variables 
in place of f and g, as 


eG +H) 


If we wish to avoid the explicit appearance of t in the equations, 
we may write 


(7.7) 


where 


rf =v 4 iw, rg = v — iw 
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and obtain, finally, the fundamental equations 


III. Spherical Harmonics with Spin. 


Let f(r), g(r) be a solution of equations (7.7); then in the 
rotated co-ordinate system 


pı =f. p, i =g. pi w= f.t, = ET 
where the factors p, r are constants independent of r. On 
returning to the original co-ordinate system each of the pairs 


bs, Wa; Wi, we undergoes the transformation ø associated with 
the rotation s. Consequently 


fı = fei + gtr ty = gp, + fry 
Yo = fPa + ste | Po = gpa + fre 
in which f and g depend only on r, and the factors p, r only on 


direction, i.e. on the spherical co-ordinates 6, d introduced by 
setting 


(7.9) 


x + iy =r sin Oe? z=r cos ĝ; 
the coefficients in (7.9) must further satisfy the conditions 


px(1 — cos 0) — pa sin 0 e*t? = 0, (7.10) 
7,(1 + cos 0) + 7, sin 0 e * = 0, 


On substituting the expression for & in polar co-ordinates 
[II, (4.10)] into the Dirac conservation law, we are led, with 
the aid of (7.9) and (7.10), to the differential equations 


sin girs + ie + k(1 + cos 6)7, = 0, 


kag O71 


(7.11) 

sin ew — S$ — k(1 — cos @)p, = 0. 
We have thereby accomplished the transformation of the Dirac 
wave equation into polar co-ordinates. (7.9) corresponds to the 
substitution % = f(r) Y, of the scalar theory; in place of the 
single factor f depending only on the distance r we have here the 
pair f, gand in place of the surface harmonic Y, depending 
only on the direction we have the matrix 


Pi Ti 
Pa Te 


SPHERICALLY SYMMETRIC FIELD 231 


The equations (7.11), together with the conditions (7.10), define 
the “ surface harmonics with spin of order k”; they are quite 
independent of the potential ®. The characteristic values E of 
the equations (7.7) or (7.8) are the energy levels associated with 
quantum number k, 

As in the theory of the ordinary spherical harmonics, we 
here again seek out those spherical harmonics with spin which 
contain the meridian angle only in the multiplicative factor e™ : 


py = e'™ (sin O)-™> P, ry = emt (sin Om- Q. (7.12) 


Substituting these expressions in (7.11) and taking z = cos 0 as 
the independent variable, we find 


dP 
(1 — 2) = — mP + ko, 


(7,13) 
(1+ j2 = mQ — kP. 


We denote the solutions P, Q of these equations which lead to 
non-singular functions p, r on the sphere more precisely by 

Pm), OM) It suffices to consider the case k >O, for (— P, Q) 
is a solution of the equations obtained by changing k into — k: 


P22) = — PE), Oz) = Q(z). (7.14) 


Furthermore, 


d Pin) agi n) 
== P(m-}) Sr = Q(m~}) 
mn, Q 
for the derivatives of P™), Q) atiy the differential equations 
(7.13) with m — lin place ofm. Form = — k, P=1,Q0=-—1 


is a solution which satisfies all continuity requirements on the 
sphere, since the multiplicative factor 


(sin 8) memt = (x — iy)" 
is finite for negative m. Consequently we find polynomial 
solutions of (7.13), the degrees of which are 0, 1,- », 2k — 1 
corresponding to the values m = — k, —k +1, R—1. 
The solution for m = k — 1 is 
P(z) = (1 — a1 + 2)", Q(z) = (1 + a1 — s). 


We thus finally obtain the following explicit expressions for the 
spherical harmonics with spin : 


P2) = Z ah- (1 + 2)"}, OPP) = i L+) 
(7.15) 
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where p=k—1—m. They behave very much like the 
ordinary spherical harmonics. The following equations are 
also of importance : 


PẸ®(— 2) = (— 1)? . Oz), Q(— 2) == (— 1)? . Ma) (7.16) 


§ 8. Selection Rules. Fine Structure 


I. Selection Rules. 


In a solution % defined by (7.9), (7.12) yy, like py and 7, 
contains ¢ only in the factor e'™è and Wy, like p and Ta only in 
the factor eim + 1} ; correspondingly for hi, oy. Hence 


=LA Llp ee 
M, h — i d$ + 9 p — (m | 5 ýa 
lW lpas fma] 
Mite = 733 g= (m H g Ya 
The z-component of the moment of momentum in the state 
(k, m) is accordingly m + 4 This change in the meaning of 


. l 
the quantum number m is to be carefully noted: m -+ g runs 


through the values 
l 3 lo‘, , 
k—J kpn ktaj jel n, "=J 


as it should. 

In order to obtain the selection rules for the possible transi- 
tions (k, m) > (k’, m’) and to obtain the corresponding intensities 
we must calculate the matrix which represents the energy of 
interaction between the atom and radiation in terms of the 
co-ordinate system determined by the characteristic functions 
y defining the quantum states n of the atom, Proceeding 
as in II, § 13, we see from (5.15) that this matrix is 


3 
EX Soby ' 
p=} 


The vector ec © here plays the same rôle as d there, The in- 
tensities are essentially determined by the elements O(n’). 
the three components of which are 


So(nn') = (PS, pnay. 


_The selection rules are merely consequences of the fact that 
© isa vector. We first obtain the old result for mand j from 
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considerations involving the proper rotations of space. The 
rule for 7 asserts that the auxiliary quantum number k may go 
over into 

+(k— 1), +k, +(k +1). (8.1) 
To the reflection t corresponds the interchange T of the two 


pairs (1, Wa, (Yi, Y2). In polar co-ordinates this reflection 
consists in the transition from (0, $) to (m — 0,7 + 4); z = cos@ 


is thereby transformed into — z and the factor e* takes on the 

sign (— 1)”. In accordance with (7.15) and the expressions 

for py, Ta; Po Te this results in an interchange of p,, 7, with 

possible change of sign, as represented by the substitution 

0 1 O 1 
0 1 0 


and the same for pe, Tz. By (7.9) we therefore have for %& with 
auxiliary quantum number k: 


T4(— xX, — Y, = 2) = (— 1)*"hb(x, Y, 2). 
The sub-space Ry thus has the signature 6 = (— 1)®71; this 


result was derived under the assumption k œ> 0. On replacing 
k by — k and applying (7.14) we find in place of (7.16) : 


POY — 2) = (— 1P POM (2), QT a) = (— IPEPE). 


The signature corresponding to auxiliary quantum number 
— k (k > 0) is accordingly (— 1)*. On setting 


(— 1)?*™ =(— 1) 


? 


l = — k when k is negative (j=-k-5=1-5), 
1 ni 82 
l = k — 1 when k is positive (j=k-5=!+5), 


both possibilities are included under ô = (— 1)’, or we could 
also write 8 = sen k- (— 1)*"}. The only coefficients occurring 
in a proper vector are those corresponding to transitions in 
which the signature is reversed. Our selection rule (8.1) for 
k is thus narrowed down to 


k > k—l,—k k+l. (8.3) 


The following table gives the value of the auxiliary quantum 
number k associated with each possible combination of / andj: 


~~ |0 1 2 3 č 4t 


ananena eea oraaa 


j=1-5 —] —2 —3 —4- 


jsib 2 8 4 Be 
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Il. Transition to the limit c—> œ. 


In order to return from relativistic to ordinary mechanics 
we must pass to the limit c- œ. Before applying this to 


equations (7.8) we must replace U, v by my + =1 eV; we then 


. U. . am 
have, on neglecting “a in comparison with PE 


wom (fet 
2mw = — a(S —*\n 


on eliminating w we obtain 


=- BEL Me 


had k(k~-1 
aa a= Nu + Uv = Q. 


On introducing | by (8.2) we have in both cases k(k — 1)=/(/-+1). 
Hence in the limit terms with the same /, and therefore those 
with auxiliary quantum numbers k and — k — 1, coincide with 
that one associated with azimuthal quantum number / in the 
scalar theory of Chapter II. The doublet found in alkali spectra 
—and in general the multiplet structure of spectral lines—ts 
accordingly explained as a relativistic phenomenon. 


HMI. H, Het,- >> 
In a Coulomb field with nuclear charge Ze we have 
Za 
-9 = Ta’ 

employing Heaviside units, which are better adapted to a field 
theory. In the following calculations we shall denote the 
a 
An 


and we shall set mjc = v, In order to integrate equations 
(7.8) we first perform the substitution 


v=. F, woe r-G, 
where 8 is a positive constant. Our equations are then 
d isk GB (Vw 
(a+ 5S GPa (Z—F)F + eG 


(g-Hr+tonsr—C+2)¢ 


dr r 


or 


multiple ae of the fine-structure constant — simply by « itself, 


(8.4) 
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Our method will lead to a solution if we choose the constant B 
in such a way that the determinant of the linear combinations 
of F and G on the right vanishes : 


v\? Vo\ ? 
(a) ~ (2) + R= 0, cB = Vu? — (8.5) 
We now seek a power series solution 


F = Sar, G= xb, r”, 
where the exponent » begins with an initial value Ho and runs 


through the values Ho, Mo + 1, p +2, >+ On substituting 
these in (8.4) we obtain the recursion formule 


(u + k)d, — a au = ( — Napa + B bui 
ab, + (u — k)a, = B api — (-+ P) by 


The initial exponent p = py is determined by the fact that the 
determinant of the coefficients of a,, b, on the left must vanish 
for this value of the index : 


p — kè p= 0; m = Vk? — a2, 
Because of the manner in which B was determined in (8.5) there 


(8.6) 


. * * . . Vv V 
exists a linear relation, with coefficients — + 2, 8 between the 
c ct 


m-l” 


right-hand sides of (8.6) which is satisfied identically in a,_,, b 
Hence for all p 


¢ + =) [(@ + k)bu — xau] + Bla bu + (u — kja] = 0 
r 


oO 


v v l v v 
b| (E+) u+ + aB] + al Bu = w) —(2+)a]=0. 
(8.7) 
The power series will break off with the term with exponent p 
if on replacing ap-u, bu, by a,, bu the right-hand side of (8.6) is 
made to vanish. The condition for this is that 
Vv Vo — ; 
Bb, + C- a, = 0; (8.8) 


C 


it will be satisfied in virtue of (8.7) if the determinant of the 
coefficients in these two equations vanishes : 


(F-2)[ (E+ 2)G+e) tae] eu-n- Ea] 
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or by (8.5) 


Since the exponent u with which the series break off must be 
of the form p + n, where n is a positive integer, we obtain the 
fine structure formula 


v E 1 ——— 
VAc Vm — E a(n + Vk? — at). (8.9) 


The solution % of our differential equations, for the char- 
acteristic values v = cE defined by (8.9), is of the form 


e~ Ar. pko. (polynomial of degree n in r) 


and satisfies the condition that the spatial integral of |x|? con- 
verge in the neighbourhood of the singular points r = 0, œ. 
These E consequently constitute the discrete term spectrum of an 
ion with nuclear charge Ze and having but one electron outside 
the nucleus. If we neglect the small constant a in comparison 
with k, E depends only on n + |k|. This fine structure formula 
further tells us that the two terms with auxiliary quantum 
numbers k and — k, or the two terms with the same 7 and for 


which /=7 + ‘exactly coincide. That this is in fact found 


to be the case has already been mentioned in § 4. Equation 
(8.9) has had a remarkable history. It was first derived on the 
basis of the older quantum theory by Sommerfeld and, at about 
the same time, verified by the experiments of Paschen ; it was 
perhaps the greatest triumph of that theory, next to Bohr's 
explanation of the Balmer series and his calculation of the 
Rydberg number from universal atomic constants. The new 
quantum theory at first destroyed this beautiful agreement, 
as in its scalar form it led to (8.9) with the half-integral quantum 
number j in place of the integral |k|. Sommerfeld’s original 
formula was only completely re-established with the advent 
of the Dirac theory here discussed. The quantum number k, 
which was used in the older quantum mechanics in place of l 
and which may assume the value 0, has also re-appeared and 
is now supplied with a sign. But on the other hand, the number 
of components in the fine structure is now greater than in 
Sommerfeld’s theory, as in addition to the transitions k>k—], 
k + 1 we may now also have k > — k; this addition is also in 
agreement with experiment. 
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Our conclusion that (8.8) was to be satisfied in virtue of 
equation (8.7) for the unknowns a,, ba, assuming that the deter- 
minant of the two equations vanished, fails when both coefficients 
of equation (8.6) are zero: 


vty _« pk 
cB u+k a 
It follows from this that then u = Vk? — &?, or n = 0, and that 
p+k<0, or k<0. There actually exist no terms n = 0, 
k = — l,— 2,» +». For the coefficients a,, b, of the beginning 
term in the corresponding solution, which is at the same time 
the end term, would by (8.6), (8.8) necessarily satisfy the equations 


(u + k)bu — 2a, = 0, aby + (u —k)a,= 0, Bb, + “a, = 0 
or 


TT ernment A mitment «AAA 


and this is impossible because of the condition |r| < v9.18 

In accordance with the foregoing we may describe the normal 
state of the hydrogen atom; n=0, k=1 (L= 0), as follows. 
We take the quantum number m, which may assume either of 
the values 0, — 1, to be 0. Let a = 0-532 A. be the radius of 
the first Bohr orbit and « = 7-29: 10-3 the fine-structure con- 
stant. yy, Pz; Wr, Pr are obtained by multiplying the radial 
function 

Ar) = ewe. pyi =- 


with the factors 


(1+ V1— a) +iacos 6, iasin bet] dy pa 
(1+ V1 — a?) —iacos 6, — ia sin 0 eiè | pi oe. 


We find from these expressions that the probability density  y 
is distributed spherical-symmetrically in accordance with the 
law 

p = Ain). 


The normalization is here not chosen in such a way that the 
integral of p over all space is unity ; it is actually 


A\1+ 2/1 — a yi 
4n(35) ra + 2V1— à). 


We have already seen that in a certain sense the probability 
density multiplied by — e represents the distribution of charge 
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in the atom. Considering the probability current as deter- 
mining the convection of this continuous charge distribution p, 
we find that it represents a circulation about the z-axis with 
velocity «c sin @ (ac is the velocity of the electron in the first 
Bohr orbit on the older theory). On giving the axis of rotation 
all possible directions % runs through the 2-parameter family 
of characteristic solutions for which n= 0, R=1; we may 
take as a basis for this family of solutions the above (m = 0) 
and that for which m = — 1, representing a circulation in 
the opposite direction. 


C. THE PERMUTATION GROUP 


§ 9, Resonance between Equivalent Individuals 


The Hermitian forms Q, which represent in system-space all 
possible physical quantities of a given system, constitute a 
totality 2 within which addition and multiplication is defined. 
If & were reducible we could choose our co-ordinate system in 
system-space in such a way that all Q would be simultaneously 
completely reduced; these individual parts into which the whole 
would be divisible would then each constitute solutions of the 
quantum problem which were merely accidentally joined to- 
gether to form the given solution. In accordance with the 
fundamental Aristotelian postulate of “nihil frustra’’ Nature 
could hardly be expected to indulge in such a superfluous luxury. 
Hence we propose the thesis that Æ is an irreducible system. On 
introducing as fundamental quantities the canonical variables 
as in II, § 11, this assumption contains the requirement that it be 
impossible to choose co-ordinates in system-space in such a way 
that the 2f matrices qu, © * *, dy; Pu * * *, Py are simultaneously 
completely reduced. This postulate 1s to be added to the Heisenberg 
commutation rules as an essential supplement. 

In accordance with Burnside’s theorem [III, § 10], which 
we carry over without scruple from spaces with a finite number 
of dimensions to those with infinitely many, the irreducibility 
postulate allows us to assert that there can exist no linear 
homogeneous relation tr( 40) = 0 between the components of 
Q which is satisfied for all Q. Since in the domain of the Q's 
not only is multiplication possible—as presupposed in Burnside’s 
theorem—but also addition, we arrive at the conclusion that all 
Hermitian matrices in system-space are contained in 2. It is 
perhaps desirable to express our requirement directly in the 
form: any Hermitian form represents a physical quantity of 
the system. In accordance with II, § 7 there is associated with 
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each statistical ensemble a positive definite Hermitian form A 


in such a way that tr(AQ) is the expectation of the quantity 
represented by Q. Burnside’s theorem asserts that the equation 


tr (AQ) = tr (A’Q) 


can be satisfied for all Q only if A = 4’, or it is impossible to 
distinguish between the two statistical aggregates represented by 
the positive definite Hermitian forms only if A= A’. Inparticular 
it follows from this that the states represented by two rays in 
system space are physically different if the two rays are distinct ; 
this was to be expected, or even required, from the outset. 
These consequences show the naturalness and cogency of the 
irreducibility postulate, from which it can conversely be deduced. 

The states of physical entities I which are fully equivalent, as, 
for example, the electrons in an atom, are to be represented by 
vectors g = (%,) or rays in the same system-space R. If two 
such individuals unite to form a single physical system J? the 
vectors of the corresponding system-space R x R= R? are, 
in accordance with the general rule of Xx-multiplication, the 
tensors (%;,) of order two. But, by III, § 5, R2 is reducible into 
two independent sub-spaces {R?} and [R2], the space of anti- 
symmetric and the space of symmetric tensors of 2nd order. 
Physical quantities Q of I? have only an objective physical 
significance if they depend symmetrically on the two individuals. 


This requirement is expressed in terms of the elements of the- 


Hermitian form 
Q= Žqix, ik Vik Soy 
by the symmetry condition 


Thi, ei? = Qik, ik’ (9.1) 

On reducing (%,,) into its anti-symmetric and its symmetric 
parts, 

Xir = x{ik} + x(ik) (9.2) 
Q is reduced, in virtue of (9.1), into two Hermitian forms in 
x{ik} and x(ik) respectively. For on substituting (9.2) into Q 
we obtain four terms: those in which {R*}, [9] intersect them- 
selves, and the two in which {R?} intersects [R?] or conversely. 
These last two then vanish, for if we interchange the dummy 
indices i with k, 1’ with k’ in 

[Q] = Zain, vx Hak} x(i'k') 

and then replace 


dki, k'i” x{ki}, x(k't’) by dik, ik? — x{ik}, x(t'k’) 
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we find [Q] = — [Q], or [Q] = 0. The totality of Hermitian 
forms Q which represent the quantities of J? depending sym- 
metrically on the two individuals is therefore not irreducible ; it 
can be reduced in accordance with the decomposition 


R? = {M7} + [KH] (9.3) 
of the space R?. 

In particular, every possible interaction between the two 
individuals depends symmetrically on them, even when other 
physical elements, such as a radiation field, are also involved. 
Hence if J? is at any time in a state contained in one of the 
sub-spaces {Rt} or [R?] it is for all time impossible to get it out 
of this sub-space by any influence whatsoever. Again, we expect 
Nature to make use of but one of these sub-spaces, but the 
irreducibility postulate offers us no clue as to which one she 
has decided on. 

Take as co-ordinates in the system space R of the individual 
I the principal axes e; of the energy associated with the char- 
acteristic numbers Æ; Disregarding the interaction between 
the two individuals for the moment, the system 7? has as energy 
levels E; + E, with characteristic vectors €; X €p = êi; each 
characteristic number of the type E, + FE, appears twice, and 
the corresponding characteristic space is spanned by the vectors 
êi and êa. On introducing the interaction as a small per- 
turbation the two states ¢,, and ĉa are in resonance with each 
other. Denoting the components of the total Hamiltonian 
function by H(ik, i’k’), the transformation of the sub-matrix 


H(1 2,12) H(12, 21) 
H(2 1, 12) A(21, 21) 
to principal axes, as required by perturbation theory, can in 


the present case be performed in a manner which is universally 
valid; we need only to replace the fundamental vectors e,9, ĉa; by 


alta — êz), Salta + ĉn). (9.4) 


Denoting H(1 2, 1 2) = H(2 1, 2 1) by hv and the numbers 
A(1 2, 2 1) = H(2 1, 1 2), which must be real in virtue of the 
condition H(1 2, 2 1) = H(2 1, 1 2) of Hermitian symmetry, by 
ha, the resonance equations become 

1 dx 

7 P + (vti + %%1) = 0, 

l dx 

ara (a Xia + VX) = 0 
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from which it follows that 


Cres = Za) = — t(v — a) (X49 — Xa), 


Aas E fa) = — ifv + a)lxi2 + Xa). 


Taking as initial conditions x,, = 1, %., = 0 for t= 0, we find 


Xy — Xag = OOM yy F Xy = THOT DE (9.5) 
| Xa |? = cos? at, | £o |? = sin? at. 


We see from this how the two states êja ¢,, alternate back and 
. ., 2 
forth with the beat period < , Whereas the components (9.5) 


along the axes (9.4) have always the same constant absolute 
magnitudes. 

The only characteristic numbers associated with the system 
space {R?} are those of the type E, + E, each of which appears 
exactly once, but the sub-space [R?] has simple characteristic 
numbers of the type 2E, in addition to these. Hence if Nature 
decides in favour of {R?} both individuals can never be sim- 
ultaneously in the same quantum state with energy E,—assum- 
ing this energy level for the individual system is non-degenerate. 
That E, + E, occurs only once in {R?} and only once in [R?] 
means: the possibility that one of the identical twins Mike 
and Ike is in the quantum state E, and the other in the quantum 
state E does not include two differentiable cases which are 
permuted on permuting Mike and Ike; it is impossible for 
either of these individuals to retain his identity so that one of 
them will always be able to say “ Pm Mike” and the other 
“Tm Ike” Even in principle one cannot demand an alibi 
of an electron! In this way the Leibnizian principle of coin- 
cidentia indiscernibilium holds in quantum mechanics.}’ 

On passing from 2 to f equivalent individuals J it is not so 
easy to reduce the representation (c)f of the complete linear or of 
the unitary group in system-space into its irreducible con- 
stituents; we shall go into this matter in the last chapter. 
Nevertheless we know from III, § 5, that the anti-symmetric 
and the symmetric tensors of order f with components 


{RyRy “es ky}, x(RyRe so ky), 


respectively, each yield such an irreducible representation. 
A physical quantity Q of the total system Jf which depends 
symmetrically on all f individuals will be represented by an 
Hermitian operator Q, the coefficients q(k,k,-+-* ky; Rike~* + Ry) 
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of which are unchanged on subjecting kika’ '* ky and kik, >> + kr 
simultaneously to the same permutation. It is evident that such 
an operator always sends an anti-symmetric tensor x{k,k,-:- ky} 
into an anti-symmetric tensor %’ ; 


x' {kikat Ry} = Zqlkika + * * kz; kika © > > k)æfkiky a Ry}- 


Hence the sub-space {R} of anti-symmetric tensors is reduced 
out of the system-space Ñ’ of If, determined in accordance with 
the general rule of x-multiplication, in such a way that if Z7 
is ever in the system space {i} it remains there forever, regard- 
less of what influences may act upon it. The sub-space [R] 
of all symmetric tensors x(k) of order f can similarly be separated 
out of R. The energy level E, + Fy, +: ‘+--+ E, which is 
J!-fold degenerate in R, appears in {R} as a simple level. Only 
characteristic numbers of this type appear in {R}, but the 
characteristic numbers of [R] are all numbers which can be 
obtained by summation of f distinct or non-distinct energics Æ. 

If the system space is n-dimensional, {R} is only possible 
f f Sn. IfE is an n-fold energy level of the individual I then 
the quantum states with energy E constitute an n-dimensional 
sub-space R(E). If it should happen that only {N} is realized 
in Nature, then in view of the foregoing it would be impossible 
to have more than n individuals of the system If in the quantum 
State E. 

The reduction of R to {R} or [Rf] involves relationships 
which frustrate any attempt at description in terms of our 
old intuitive pictures with their orbits and billiard-bal] electrons. 
But the difficulty enters already with the general composition 
rule, according to which the manifold of possible pure states 
of a system composed of two parts is much greater than the 
manifold of combinations in which each of the partial systems 
is itself in a pure state. 


§ 10. The Pauli Exclusion Principle and the Structure 
of the Periodic Table 


One of the most fundamental facts of N ature, the ordering of 
the chemical elements in the periodic table, can be understood 
only with the help of these considerations. We go from one 
atom to the following, which we denote by A, in two steps : 
the first is preparatory and consists in increasing the charge 
on the nucleus by 1, and the second and final step consists in 
adding an electron to the ion At so obtained. To obtain the 
normal state of A this additional electron must be bound as 
tightly as possible, i.e. the energy of the total system A must be 
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a minimum. If we disregard the mutual perturbations of the 
electrons for a moment, although they may be very considerable 
we might expect to find every electron in an unexcited atom in 
the lowest energy level, i.e. with principal quantum number n = 1. 
But instead we find the following: The 1 electron of H and the 
2 electrons of He are in the 1s orbit, i.e. they are in the quantum 
state n = 1, l = 0. But the next 2 electrons, which are added 
in going over to Li, Be, are in a 2s orbit, and the additional 6. the 
addition of each of which gives rise to one of the elements from 
B to Ne, enter the 2p orbit. Then follow Na, Mg, each with a 
new electron in the 3s state, the elements from Al to A. the 
additional electrons entering the 3p orbit, etc. These facts 
are readily seen on writing the wave number of the lowest 
S term in the form — R/n®,; in H, He, Li the “ effective 
quantum number” , has the values 1'00. 0:74, 1:59. That 
n, sinks on going from H to He is understandable in view of 
the ‘screening ” effect of the original electron on the new one. 
We should expect that if the next electron also went into the 
orbit n = l the corresponding value of n, would be something 
like 0:59, but we find instead a number which is greater than 
this by unity. The same occurs on going from Be to B or from 
Mg to Al; the normal states of these atoms are formed by the 
valence electrons entering 2p or 3p orbits because the 2s or 3s 
orbits are already “ occupied,” and if the valence electron is 
raised to an s state by excitation, it can only be raised to one 
for which n 23 or n 2 4.* Obviously the essential features 
of the regularities expressed in the periodic table depend on this 
mysterious numerus clausus for the various states with principal 
quantum numbers 7n = 1, 2,--- and on the fact that in conse- 
quence of this the electrons in the atom are added on in definite 
layers or ‘‘shells.’’ Stated more precisely, in an ms orbit 
(n = 1, 2,---) there is room for but 2 electrons, in an mp orbit 
(n = 2, 3,-+-) for but 6; in general the situation is described 
by Stoner’s rule: there can be at most 2(21-+- 1) electrons in a 
state with quantum numbers n, l. 

On taking into account the duplicity caused by the spin we 
see that this number is exactly the dimensionality of the sub- 
space R(n?) in the system-space of a single electron. Neglecting 
the spin perturbation, which is indeed much smaller than the 


* The physical significance of the “ true principal quantum number ” 
n is contained in these considerations : we think of the term in the Hamiltonian 
function which represents the energy of interaction between the various 
electrons as multiplied by a numerical factor A and let A decrease steadily 
from 1 to 0; this virtual adiabatic process sends each electron into a definite 
hydrogenic orbit with a principal quantum number n, the “ true quantum 
number ” of the electron. 
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mutual perturbations of the electrons, the energy level associ- 
ated with this sub-space is 2(27 + 1)-fold degenerate. This 
degeneracy can be removed by the introduction of the spin 
perturbation and a weak magnetic field; the energy level is 
then broken up into 2(2/ + 1) simple components distinguished 
by the quantum numbers 


, l . . 
J=l£ğ m =],]— l, “7%, — J. 


Stoner’s rule led Paul: to postulate the exclusion of equivalent 
orbits: it is impossible for two electrons in an atom to be simul- 
taneously in the same quantum state (n, l, j, m). This shows 
that Rf is obviously not the system space of the physical system 
If in which f electrons revolve about a fixed nucleus, but that 
the reduction to {R} takes place: Nature has decided in favour 
of the reduction to the space of anti-symmetric tensors, at least in 
the case of electrons. In view of the considerations advanced in 
the previous paragraph this principle leads conversely to Stoner’s 
rule.18 

If the formation of one atom from the preceding one were 
an entirely regular process the occupation of the various states 
would take place in accordance with the following table, the 
lower row of which indicates the number of electrons captured, 
on going from atom to atom, by the orbit immediately above: 


ls; 2s, 2p; 3s, 3p, 3d; 4s, 4p, 4d, 4f; °° 
2; 2+6; 2+6-+10; 24+64104+14:-++ 


This would indeed be the case if we could increase the charge on 
the nuclei by some large fixed amount, for the mutual perturba- 
tions of the electrons could thus be made arbitrarily small in 
comparison with the Coulomb attraction of the nucleus. But 
even a rough calculation shows that these perturbations are 
actually too considerable not to lead to displacements in the 
above table, i.e. to changes in the order in which the various 
shells are filled. For example, after the 3p shell is filled, which 
is accomplished with A, the next 2 electrons go into 4s states 
to form K, Ca, and only then do we find electrons entering the 
3d orbits to form Sc, Ti, -- +. For details consult the books 
by Hund, Pauling and Goudsmit or Ruark and Urey mentioned 
in the Introduction. 

It is not the purpose of this book to repért on the extensive 
empirical data of spectroscopy, nor to show how the two main 
principles required to lead beyond the general scheme of quantum 


THE PAULI EXCLUSION PRINCIPLE . 245 


mechanics to the interpretation of spectra were wrested from 
this material; I here refer to the introduction of the inner 
quantum number 7 in addition to the azimuthal Z, or the spinning 
electron, on the one hand, and to the reduction of R to {R} 
by means of the Pauli exclusion principle on the other. 
Millikan begins his report to the American Philosophical Society 
on “ Recent Developments in Spectroscopy ° [Proc. Am. Phil. 
Soc. 66, p. 211 (1927)], with the words: ‘‘ Never in the history 
of science has a subject sprung so suddenly from a state of com- 
plete obscurity and unintelligibility to a condition of full illu- 
mination and predictability as has the field of spectroscopy 
since the year 1913.” The theory of groups offers the ap- 
propriate mathematical tool for the description of the order 
thus won. 

The lines of the optical spectrum are caused by quantum 
jumps of the electrons which are most loosely bound. In the 
alkalies Li, Na, K, ++ - the one involved is accordingly in the 
state 2s, 3s, 4s, +++. We also understand why their cores 
Lit, Nat, Kt, + +--+ are spherically symmetric, and therefore 
why their spectra may be approximately calculated in terms 
of the motion of an electron in a spherically symmetric field ; 
the real reason behind this is the following. That an electron 
has the quantum numbers n, l means that its state is in a 
sub-space R, of A= 2(21+ 1) dimensions. The sub-space 
{Ri X Rı X - +--+ X Rı} with A factors, as obtained by the anti- 
symmetric reduction of Rj, is l-dimensional and the rotation 
group induces in it the 1-dimensional identical representation ; 
i.e. a Shell consisting of À electrons in the state n, l acts spherical- 
symmetrically ; its presence does not increase the manifold of 
terms. Hence the “ closedness °” of those elements with which 
a shell is completed; the rare gases, which precede the alkalies, 
are elements of this kind. But we should also expect Cu, Ag, Au 
to have alkali-like spectra, as they contain but a single electron 
in the s state, while all the others are bound more tightly in 
a ‘‘ closed’ configuration with an external field which is spheri- 
cally symmetric. The valence of the elements must obviously 
find its explanation in these terms; indeed, it gave the clues 
which originally led to the discovery of the periodic table. 
But only in recent times have we been able to call on the assist- 
ance of spectra, interpreted and arranged with the aid of atomic 
theory by Bohr and others, and they have verified the principal 
features of the table, while modifying, supplementing and 
improving its details. 

The consequences of the Pauli principle for the term analysis 
of atomic spectra will be discussed in detail in Chapter V, 
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particularly in $15. We here mention briefly the results for 
the case of 2-electron spectra f = 2. . 

Just as the alkalies may be treated as if they were but 
1-electron atoms, in dealing with the alkaline earth metals we 
need only take into account the two most loosely bound electrons 
which occupy an s orbit outside a spherically symmetric closed 
shell. As before, we obtain one singlet and one triplet term 


(nl, n'l; L) 
whose total azimuthal quantum number L assumes the values 
Lele bet —1,:., [i-r | 


assuming that the two quantum states (ul), (n'l) of the individual 
electrons are distinct. The only difference is that now such 
a term appears only once, whereas before it appeared twice, 
corresponding to a permutation of the electrons. The situation 
is, however, more complicated if (nl) = (n'l). The only singlet 
terms 

(nl; nl; L) 


which actually occur are those with even L = 0, 2, - » +, 2l and the 
only triplet terms are those with odd L = 1, 3,--++-, 2l — 1. This 
rule is thoroughly in accord with the empirical data. 

The best-known lines of the spectra are those arising from 
transitions in which only one electron is not in the normal state 
and is jumping between higher energy levels. Hence if one 
of the two electrons (not saying which !) is in the normal state 
n’ = Mm, l' = 0 (m= 1, 2, 3, 4, |+» for He, Be, Mg, Ca, - - -) 
we have L= l and the two quantum numbers (n, J) suffice to 
determine the singlets or triplets. The lowest S term (L = 0) 
of the singlet system has the principal quantum number 7n = 7, 
but there is no such term in the triplet system; it begins with 
n= n +1. We find that the lowest S term in such a triplet 
system (which is, as we know, simple), e.g. in the spectrum of 
Mg, actually does lie in the neighbourhood of the second lowest 
S term of the singlet system instead of the lowest. 


§ 11. The Problem of Several Bodies and the Quantiza- 
tion of the Wave Equation 


In this paragraph we depart from our usual terminology 
and denote the number of individuals by n instead of f. We 
first consider more fully the reduction of R” to [R"], for we shall 
find that although it does not apply to electrons, it does to 
photons. Let H = ||H,,|| be the Hamiltonian function of an 
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individual. The variables P(n,, ng, - + +) of the unitary space 
[R"] behave like the monomials 


gm x”? e a ' 
=a (nı + Mg++ + + =n), (11.1) 


Vn! ngl => 


of degree n which are formed from the components x, of an 
arbitrary vector in R; we denote this monomial (11.1), without 
the denominator, by (nı, na ©). We shall have occasion 
to use the differentiation formula 


dhat xg + + +) = (my yt ge e e o day) + O et eTl e o dx) Hits. 


In the absence of interaction between the individuals we obtain 
from 


lda SH ax, =0 (11.2) 
2 at B 


the equation 
] . 
_ i P(n, Me, ° * -) = mA Higol — 1, Ne, * * *, Ag + L, C *) 
+ na Z Hop Pls, na — 1, +++, ngil,» 
-}- n . a . . . . . . . . e . . 


In the sum on the right ¢d(m, — 1, na + ++, mg +1, °° -) is to 
be interpreted as (m,, na >- *) for B= 1; similarly for the 
term with $ = 2, etc. We can also write this equation 


l. 
a 7 $l, Nagot *) = ENa H ux © Elna, a * * -) 
+ Zna Hap” QC ° ., na — I, sons ng + L, . +), 


On introducing the binomial coefficients in accordance with 
(11.1) we obtain as the equations of motion 


airy My") yn, Haa (ity, na ©) 
1 di P 


HIV nn t DA ag: plt 8) Aal, tt, aptly ++). (11.3) 
wap B 


These equations are of the form 


lay Hb = 0, H= Sep nes (11.4) 
a at a, B 
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where the matrices neg are defined by 


-) z {o" if ny = Ny, Ng =No, a.a (11.5) 


, ? 
Naalt, Na, ; Mi Ne, = 10 otherwise 


and for « + f 
Tm [V rata + 1) ay 


r È, 
Nagli Taree ; ny, No, 


where the first alternative holds when all n’ = n with the ex- 
ception of na = ną — l, ng=mg+1 and the second in 
all other cases. H is, as it should be, an Hermitian matrix, 
If H is in diagonal form the fundamental vectors forming our 
co-ordinate system are the quantum states of the various in- 
dividuals; |y(#, na - > *)|? is then the probability that there 
are simultaneously n, individuals in the first quantum state, 
Nn in the second, etc. On reduction from R” to [N7] it becomes 
impossible to identify the individuals as Mike, Ike, - - - and we 
therefore may not ask for the probability that Mike is in the 
at state, Ike is in the B™,---. If we have in addition to H a 
perturbation €W affecting the individuals (and symmetric with 
respect to these individuals), then equation (11.3) governs the 
change of the probabilities |(m,, na - - -)|? in time. 

The Hamiltonian function H reminds us of the one which we 
obtained in Chapter II, § 13 by quantizing Maxwell's equations ; 
there the individuals were photons. Maxwells equations are 
to be considered as the quantum-theoretical wave equations of 
an individual photon. If we replace the photon by an individual 
whose state (x) varies in accordance with equation (11.2) we 
are led to a new way of treating the problem of several bodies, 
which we call the “ method of second quantization ” in contrast 
to the “method of composition”’ or ‘‘ Xx -multiplication ” de- 
veloped in Chapter II, §10. In this we consider (11.2) as the 
classical equations of motion of a physical system whose canonical 
variables are the real and imaginary parts gx, Pa of Xa, and as 
such subject them to the process of quantization.12 We here 
tie on to the development given in Chapter II, §11. Introduce 
the complex quantities 


1 : Ss, i p 
Xa = Va + ipa) Ëa = RACE ipx) 


into the Hamiltonian function H as independent variables in 
place of gx, Pa; the Hamiltonian equations are then 


da dH dé H 
di *2z,’ d "3s, (11.6) 
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‘In order that (11.2) may be considered as the classical equations 
of motion of a system with infinitely many degrees of freedom, 
in accordance with our programme, they must be of the form 
(11.6). But this is in fact the case; the Hamiltonian function 
is then 
H = Z H aglap. 

In quantizing %,, «e are to be replaced by Hermitian conjugate 
matrices X,, X, which satisfy the following commutation rules: 

Xa Xp — Xp Xq == 0, Xa Xp _ Xp Xx = 0, 
0 (x + ). 
The Hamiltonian function H then becomes the matrix 

H = Xap Xa XB; (11.8) 
Xx, 


Xag — Xp Xa = Baa =f 


if H is in diagonal form then 
H = YE. X.Xx- 


We are here dealing with an infinite set of oscillators, the in- 
dividual members of which are distinguished by the index «; 
the energy of the «" is given in terms of the complex co-ordinates 
Xn, Ča by Ly Ly Xa 

The quantum theory of a single oscillator as developed in 
II, § 3 gives us as the irreducible solution of 


XX — Xx = 1, 


where x, X are two Hermitian conjugate matrices normalized 
in such a way that the energy Xx is in diagonal form, the matrices 


x(n, n + 1) = Vn + l, (n,n — 1) = Vn; žx(n, n) = n, 


all other components vanishing ; the quantum number n assumes 
the values 0, 1, 2, +. From this we obtain the solution of 
(11.7) by composition : 


[v TI if all n’ = n 


| , 

X(N, Ma," t My, Me, * * ‘) = except Na = fna + I, 
0 otherwise ; 
Va if al w =n 

pr e. ? yoa’ Ny 

X(N, No, > My, ng'e e) = except ny = Na — 1, 
0 otherwise. 


The products Xq,Xq are of course in diagonal form; X,X, is the 
matrix Nag introduced above, and (11.8) coincides with (11.4) : 
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the method of second quantization leads to the same result as the 
method of composition supplemented by the * symmeiric reduction 
of R” to [R"]. But now the number 


m+m+' =n 


of individuals is not prescribed; however H is reduced into 
sub-matrices in accordance with the various values of n, for 
all components H(nyng + °°; NN, + + +) for which nN, + n + 
c++ stn, tn,+°-+ + vanish. The total number of photons 
is not conserved, and to this extent Maxwell’s equations do not 
fit completely into the quantum-theoretical picture—unless we 
wish to consider “ non-existence ° as a particular quantum 
state of the photon. l 

The method of composition remains applicable in the presence 
of interaction between the individuals, provided it is an in- 
stantaneous action at a distance determined by the simultaneous 
values of the canonical variables of the various individuals. 
But it breaks down when, as in the theory of relativity, account 
is taken of the finite velocity of propagation, which led to the 
introduction of continuous fields in the classical theories. The 
difficulty arises from the fact that the wave function % must 
contain the one time ¢ as argument in addition to the spatial 
co-ordinates of each particle, whereas the theory of relativity 
requires that the proper tıme of each particle appear as argu- 
ment in as well as the spatial co-ordinates. The method of 
second quantızation shows its superiority in dealing with such 
problems. 

As we have seen, the method of second quantization in 
accordance with Heisenberg’s commutation rules is equivalent 
to a reduction of the system space R” to [R"]. Since we have 
seen in II, § 13 that this leads to the correct laws of radiation 
phenomena, we must conclude that the behaviour of photons 
corresponds to this reduction. But in the case of electrons the 
reduction is to the space {R"}, and we must now investigate 
to what kind of quantization this corresponds.2° The vectors 


of the unitary space {§t"} are the anti-symmetric tensors with 
components 


MO, Aa * +, An} ~ [Xas Kary t, Xal (11.9) 


in the space R, where the one row in the determinant stands for 
the n rows formed in the same manner from 7 vectors r= r, 
L, - + + 2) of R. We can obtain the totality of linearly 


independent components by restricting the indices by the 
condition 


AL ag Le <u, (11.10) 
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We now denote (11.9) by (nm, na - - +), where n = 1 or 0 
according to whether « appears in the set of indices «1, a, °°, 
a, or not; these quantum numbers ną may thus only assume 
one of two values. On replacing a, = « in (11.9) by an index 
B + x, (11.9) vanishes if B is equal to one of the remaining 
indices %,°*°, &,; if Bis different from az, ***, «, it becomes 


%{ Bots mr on} = E (ny, my Ne — l, my ng + 1, C ‘), 
the sign + 1 being (— 1)’ where r is the number of indices in 
the set æa ***, €n lying between « and £: 

r= Jn, 
i 


where the sum is extended over all indices A between « and 8. 
We again obtain equations of the form (11.4); (11.5) is then 
valid as it stands but (11.5’) is to be replaced by 


Nap (1, Me, * 8 8 ny, No, n -) = + l or 0, 
where the first alternative applies to the case in which all n’ =n 
except n, = l, ny = 0; n,=0, n,= 1, the sign being again 
determined in accordance with the above rule. On writing 
a matrix |ja(nn’)|| in the form 


a(0 0) a(0 1) | 
a(1 0) all 1) 


and introducing the abbreviations 


1 0 1 0 
[o  |=4 0 f=" 
we may write 
00 
Naw =1XKIX +s: x4 [xd x1 uty 
01 


00 
Nap= 1X1 rn x! 0 


xix A" [5 9 ax <- (x8), 


where the matrix that is written explicitly in the first equation 
is in the «tt place and those in the second in the «'® and p® 
places respectively. We must now attempt to write these 
matrices in the form X,X,; this can in fact be accomplished by 
taking 


maa xa xe sex ax Sool xixi] 
0 0 (11.11) 
Batt xe xatx| o xax tx : 
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the small explicit matrices being in the « place. x,, ¥, are 
Hermitian conjugates, and H can now be written in terms of 
them in the desired form (11.8). Instead of the commutation 
rules (11.7) we now have 


Xa Xp + Xg Xa = 0, Žu Zp + KX = 0, Xa Xp + Xp Xam Sag. (11.12) 


(11.1) is the irreducible solution of these equations by a pair of 
Hermitian conjugate matrices X«, X, which are so normalized 
that X, x, is a diagonal matrix. 

In order to show that the equations (11.4) for the vector # 
in system-space yield the Hamiltonian equations (11.6) for the 
forms 


x. = Traln; n) Hn) pn’) and Fa 


we must prove that the formula 
oH 


x, H —Hx, = aE, 


employed in II, § 11, holds here as well. We find that it does 
not hold for an arbitrary polynomial H in x,, X,, but that it 
does for even polynomials in general and so in particular for 
the Hermitian form (11.8). For we have, for example, 


XX, Xg = 84, Xp — Ža X1 Xp = 9yqn Xp + Ža Xp Xa, 
whence 
X1, Xp — X,Xp° Xi = Õie Xp, X,H — Hx, = X He Xp. 
On introducing real quantities, i.e. Hermitian forms, Po, qa 
by 


1 . ~ 1 . 
Xa = FACE + ipa), Ža = 5 (Va — ipa) 


and denoting the set pi, q2; Px, q2; °° + Straight through by 
Pi; P2, Ps, Pa, *** We obtain the relations 


P3 = 1, PaPe + Pe Pa = 0 (x B) (11.13) 


The Pa are not only Hermitian but unitary as well, as can be 
seen from the first of these equations or directly. Here again we 
meet the matrices 


01 
10 


O—1 
1 0 


? 


which occurred in connection with the spinning electron. 
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We have thus discovered the correct way to quantize the 
field equations defining electron waves and matter waves. 
Here again we find, as in the case of the spinning electron, that 
quantum kinematics is not to be restricted by the assumption 
of Heisenberg’s specialized commutation rules. 


§ 12. Quantization of the Maxwell-Dirac Field 
Equations 21 


The field laws arise from a Hamiltonian principle which is 
analogous to the Hamiltonian principle of classical mechanics. 
This latter is expressed in terms of a Lagrangian function L 
which depends on the positional co-ordinates q; and their de- 
rivatives d; with respect to time, and asserts that the first 
variation of , 


[Lidi didi (12.1) 


vanishes when the q; are assigned arbitrary infinitesimal incre- 
ments 6g; which vanish outside a certain finite time interval. 


This principal yields, on integration by parts, the differential 
equations | 


oP +L,=0 with p,=— = Li = L (12.2) 
Defining 
H = L+ Sdp . 
and noting that ' 
òL = XLi Òq; — Zp: 54; 
we obtain for the differential of H the expression 
OH = £ Liq + Xdi òp: 


Expressing H as a function of the q, and the generalized momenta 
p; associated with them, we have 


dH Ww, 
ge Tt 
and by (12.2) these are just the Hamiltonian canonical equations 
dq; H dp: _ W 
dt dp; dit 04: 


In quantum theory the q, pı are operators satisfying Heisen- 
berg’s commutation rules. 
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This reasoning can be carried over without difficulty to the 
case of a continuum, as appears in field theories. On replacing 
for the moment the 3-dimensional space by the 1-dimensional 
interval 0 <x <1 described by the co-ordinate x and assuming, 
for the sake of simplicity, that only one state function q = q(x, t) 
is involved, the integral (12.1) is then to be replaced by 


f | Lla, d) dx dt. 


Naturally L may depend on the spatial derivative <, or even 

higher derivatives, in addition to g. The continuous variable 

x takes the place of the index 7 and the Lagrangian function, in 
1 


the sense of (12.1), is now the integral [Lla d)dx with respect to 


0 
the spatial variable instead of L itself. We first replace the 
continuum by a discrete set of equidistant points defined by 


Ax = = (i =0, 1l,- --, n— 1). The differential quotients with 


respect to x are naturally to be replaced by difference quotients 
with the difference Av = l'n, and the integrals become sums. 
In accordance with the outline above we must now set 


_ _ wy, 4) . 
Pi = då Ax, 


calculated at the point x = ijn. For the continuum we have 
analogously to set 
L(g, d) 


p = — ~, 


og 
and H is to be defined by 
1 
H = L + [åp dx. 
0 


The commutation rules which are satisfied by g, p in quantum 
mechanics cause some trouble. As long as we employ the 
discrete set of points in place of the continuum they are 


qla) Ble!) — ple!) q(x) = Y= bo 
where x, x’ run independently through the set ijn and 8.4 iS 
1 or O according as x’ coincides with x or not. For fixed x’ 


l 
Dg ew = Dfe — 2’) 
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is a function of x which vanishes for all values of the argument 
other than x’ and is there so large that the sum 2'8(% — x’) - Ax 
T 


has the value 1. In dealing with the continuum we therefore 
introduce with Dirac a function 6(% — x’) which vanishes at 
all points x + x’ and is so large at the point x’ that its integral 
has the value 1 (cf. I, § 7). Of course there exists no such 
function, but it can be “ arbitrarily closely approximated ” by 
a function which vanishes everywhere except in a very small 
interval about x’ and assumes very large values within this 
interval. Only in this sense can we perform the passage to 
the limit Ax = 0 and write the commutation rules symbolically 
in the form 


q(x) p(x’) — p(x’) g(x) = 18(x — x’). (12.3) 


A good illustration of the mathematical interpretation of 
this pathological function 6(x — x’) arises in the theory of ortho- 
gonal sets of functions ¢,(x), for with its aid the completeness 
condition may be formulated 


SOx (x) = 8(x — x’). 


This is literally correct as long as x only runs through a discrete 
set of points, but the rigorous mathematical formulation for 
the case of a continuum is given by 


1 
lim í PEIE + u(x) v(x’) dx dx! = fū(x) v(x) dx 
n> Q 00 i=] 0 
where u(x), v(x) are any two continuous functions in the interval 
(0, 1). Hence from the more rigorous standpoint (12.3) must 
be replaced by the equation 
11 


f [ulala ple’) — ple!) gle) Jole’) dæ da’ = afl 


containing two arbitrary functions u(x), v(x); furthermore, it 
is to be noted that the p, q in the brackets are first to be replaced 
by approximations p™, g™—e.g. by the n™ partial sum of 
their expansion in terms of orthogonal functions—and the 
passage to the limit n —> œ is to take place after, not before, 
the integration. This interpretation offers a sound mathematical 
method of dealing with the relation (12.3). Iti is to be emphasized 
that (12.3) refers to two points of space x, x’ at the same moment 
i, i.e. in a section of the world in whicht = const. ; the arguments 
of q and p are to be written more precisely as (x, t), (x’, t) re- 
spectively. 
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On applying this general scheme to the action 
W=M+M'+=F, (5.18) 


from which the field equations for the electron and for the electro- 
magnetic field are obtained, we find ourselves faced with a 
difficulty arising from the fact that the Lagrangian function 
does not contain the time derivative of the scalar potential f, 
for the generalized momentum associated with fọ then vanishes 
identically and cannot possibly satisfy a commutation relation 
such as (12.3). We avoid this difficulty for the moment by 
utilizing the principle of gauge invariance to remove fa from the 
expression of the Lagrangian function by setting it equal to 0; 
this device has already been employed in II, § 13. The set of 
independent functions describing the state is then 


p = (xy, Pa, ibs, ha), | = (fis Jas fa), 


where we have written ,, %, in place of wr, we. The momenta 
associated with these quantities are then found to be: i, with 
J, and — E, with fp. The commutation rules which are to be 
applied in quantizing the field equations are accordingly 


TolPbe(P!) + be(P)Gp(P) = Bye 8(P—P’) [p,0=1,2, 8,4], (12.4) 
folP)Eq(P!) — E((P'Vfo(P) = 850° 8(P—P’) [p,q=1,2,3], (124) 


where P and P’ are any two points of the same spatial section 
t = const. We have here taken account of the fact that the 
quantities describing matter are not to satisfy Heisenberg’s 
commutation rules, but are instead to satisfy those obtained 
by replacing the minus sign which occurs in them by a plus 
sign. These rules must be supplemented by the assertion that 
the %, satisfy in addition the equations 


o(P)pte(P’) + po(P’)p(P) = 0, (12.5) 


and the same for ù; that the f, at any two points P, P’ are 
commutative and the same for the £,; and finally that the 
material quantities y, y on the one hand and the electromagnetic 
quantities fp Ep on the other are kinematically independent, 
and that every quantity of the first kind at a point P commutes 
with every quantity of the second kind at any point P’ (in the 
same section t = const. of the world). 

As in II, § 13, we again consider the whole system enclosed 
in an insulated and perfectly reflecting cavity which is at rest. 
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In order to describe the electro-magnetic potentials we make 
use of a complete orthogonal set of solutions f of 


Af + vf = 0 (12.6) 
in the cavity, which satisfy the conditions 
div f = 0, f normal 


at the walls. The construction of such a system is readily 
obtained from the Gauss divergence theorem 


f (curl f: curl g + div f- div g +f: Ag)dV 
= (Ef, curl g]n + În div g) do (n denoting normal component) 


for the vector [f, curl g] + f div g, f and g being two arbitrary 
vector fields.?? We first determine the scalar functions ¢ = 4, 
which satisfy the equation Ad + à? = 0 and vanish on the 
walls, and from them construct the vector fields fa = grad 4); 
these vectors J, automatically satisfy the conditions above, 
are of course mutually orthogonal and can be normalized in 
accordance with the equation 


f- fed = By = 2] bdraV ]. 


We also determine a complete normal orthogonal system ], of 
solutions of (12.6) which are normal to the walls but which 
satisfy the condition div f, = 0 everywhere, not only at the 
walls. The fa are then orthogonal to these f, and they con- 
stitute together a complete orthogonal system for vector fields 
in the cavity. We may consequently write 


f = ae + a 


—€= pf. — Lab (2.7) 


in the section £ = const. The f}, fs are vectorial functions of 
position in space and have as values ordinary numbers, whereas 
the p, q are scalar quantum mechanical matrices which are 
independent of position and which satisfy the commutation 
rules | 


Py — Prg = i, Ula paqa =i; 


all g commute among themselves and all p among themselves, 
and any p commutes with any q whose index is not the same. 
[These rules are perhaps most readily obtained by solving 
(12.7) for the “ Fourier coefficients ” p, q in terms of integrals 
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of scalar products of f, © with f,, fp and applying the commuta- 
tion rules (12.4).] The energy 


l 2 l 2 
3) {#6 +z (curl f) lav 
of the electro-magnetic field becomes 
1 2, wo ero 
Zale + g) + 32% 


We already know the solution of the commutation rules which 
reduces this expression for the energy to diagonal form. The 
individual components of the vector on which the p, q operate 
are distinguished by means of the quantum numbers N,, corre- 
sponding to the v, and the values of the continuous variables q, 
corresponding to the À On setting q, = ~/«-Q,, Q, is an 
operator which affects only the index N, in accordance with 
the equations 


O(N N, — 1) = Ne, Oy N, + 1) = fet, 


all other components, corresponding to transitions N, —> N, 
in which N; is neither N, + 1, vanish. N, assumes the integral 
values 0, 1, 2, - - and can be considered as the number of 
photons of the kind v. The momentum p, associated with the 
continuous variable q, is, following Schrodinger, represented by 
the operator Ta The electro-magnetic energy is then in 
A 

diagonal form and, on neglecting the (infinite!) null-point 
energy, multiplies the vector component (N,; qa) with 


ZN, + 524i (12.8) 


We thus see how it happens that the electro-static part, which 
is described by the continuous variable g}, is separated off from 
the part due to the radiation, described by the discrete N, 
giving the number of photons of kind v. 

The % appear in the part of the energy due to matter only 
in combinations of the form $, Ye. Consequently it will be found 
advantageous in dealing with electrons to apply the method 
of composition followed by anti-symmetric reduction; we have 
shown in the preceding section that this procedure is equivalent 
to quantizing in accordance with the rules (12.4’). Since the 
electro-magnetic quantities commute with the y,, Y, they may 
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here be considered as ordinary numbers. The quantized wave 
equations then refer to a ‘‘ vector ” 3 with components 


Sor... on(Pa ee Pas Ny; qa), 


where Pi, © °°, Pn are the positions of the n electrons and 
Pı, ° * ', Pa are their spin variables, each of which runs through 
the four values 1, 2, 3, 4. We write z.. .p, a8 a column 
consisting of 4" terms; this z is anti-symmetric with respect 
to a permutation affecting the P, and p, alike. 6” = (SP, 
SP, SẸ) is the spin vector (S,, Sa Sa) operating only on the 
7 index p, I) is similarly the operation on the r index p, 
which interchanges Yı, a with ws, Ya, and grad”) is the gradient 
with respect to P,, The part of the Hermitian energy operator 
— Fo in the equation | 


ld 
Fie — Fo3 = 0 (to = ct, H = — cF,) 


which depends only on matter is 


n 


$ (©, + grad + VADO- FP.) + p> grad dal P,) + Š) 


r=1 7 dga 
+m STO) (12.9) 
yo] 


and to this must be added the electro-magnetic part (12.8). 
Since we have throughout taken the scalar potential fọ = 0 
we have lost the equation l 


div È + p = 0 (12.10) 


arising from the variation of fẹ This equation contains no 
derivatives with respect to time, and consequently represents 
a condition on the state of the field at a moment t = const. ; 
we must naturally take it into account. On substituting the 
value of © from (12.7) we obtain 


2102. Ad; +p=0 


and on multiplying with ¢, and integrating over the space under 
consideration 


qa — fopi dV = 0. 


From the standpoint of quantum mechanics the left-hand side 
of this equation is an operator D, and the meaning of the 
equation D, = 0 is that only those vectors 3 which satisfy the 
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equation D,3 = 0 are to be allowed. D, also consists of an 
electrical part q, and a material part 


fe g,dV = A ty + Pepe + Papa +. Pa Ya) dV. 


The operator D, which is to be applied to 3 is accordingly 
Dı = qa — 2bi(Pr). 


The equations D,g=0 then assert that all components 
a(P,; N,; qa) of § vanish except those for which q} = S'ha P,); 
r=1 


we may therefore write the non-vanishing components as 
n 
PPr; N,) = 2[P,; N,; ZpalP,)]. 


But then 


grad” 4 = grad”) z+ X grad ¢,(P,) - = 
A A 


is exactly the combination which appears in (12.9). 2g? is 
now given by A 


È ZHP) HP) = È GP, P) 


r,4=1 å T, 8 om 


where 


G(P, P’) = Z plp) pi(P’) 


is the ordinary Green’s function for the cavity. We conse- 
quently obtain the quantum equation 


A ee 


1 dX, 
for , in which the operator 
n ] n 
— Fo= EAS, grad) + mT! +E 3 GP, P.) 
r= rml 
+ IYN, + Vax DUS, P). O} (12.11) 
In Dirac’s theory 


1 
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is the energy operator for a single free particle. «G(P, P’) is 
the classical potential due to the electro-static repulsion be- 
tween two electrons situated at P and P’. The next term 
represents the sum of the energies v of the photons in the various 
frequency states v, and finally the last term represents the 
interaction between photons and electrons by emission and 
absorption. The meaning of each of the terms from which 
the energy operator (12.11) is constructed is thus apparent. 
The quantum theory had previously dealt with fields, such as 
that which binds the electron in hydrogen to the nucleus, in 
a manner entirely different from that with which it treated the 
field of the emitted radiation ; the first was calculated classically 
and purely electro-statically as an action at a distance described 
by the Coulomb potential, whereas the second was broken up 
into discrete photons with the aid of Bohr’s frequency condition. 
We have now obtained a theoretical justification for this pro- 
cedure which led to good agreement with experiment. 

Our expression shares with classical electro-dynamics the 
disadvantage that it contains the term G(P,, P,) representing 
the infinitely large reaction of the 7 electron with itself, for 
as we allow P’ to approach P, G(P, P’) becomes infinite like the 
reciprocal of the distance PP’. We should therefore replace 
G(P, P) by the finite I (P, P) where 

P S O LA mma 2 4 l 
(P, P') = G(P, P’) D. BP 


for this amounts to dropping an infinitely large additive con- 
stant from Fẹ T(P, P) represents the effect on an electron at 
P of the field obtained by reflecting the field of P in the walls 
of the cavity. (12,11) shows explicitly how the various terms 
of Fa depend on the value of the fine-structure constant «; on 
developing the solution in powers of a we are faced again and 
again with infinitely large terms of the same kind as G(P,, P,). 
The operator F, contains singularities which, at the present 
stage, frustrate all attempts to carry through the theory. We 
may indeed conclude with P. Fordan that the problem of the 
existence of the electron is solved, but that that of its con- 
stitulton has as yet eluded us. Our equations further suffer 
from the fundamental disadvantage of the Dirac theory that 
the individual spin variables p, assume 4 instead of 2 different 
values.” 

There is, of course, nothing to prevent us from quantizing 
the matter waves in a manner analogous to that applied to 
electro-magnetic waves. We should then develop our quantities 
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describing the material field in a series of characteristic 
functions ù = %) (with four components) of the Dirac equation 


He, caatmr+ub=0 a2 


which constitute, on imposing appropriate boundary conditions, 
a complete orthogonal system. The general component æ of 
the vector 3, on which the energy — c¥ operates, will then depend 
on the quantum number ną, which corresponds to the char- 
acteristic values u and which may assume only the values O and 
1, and in addition on the numbers N, of photons of the various 
frequencies v and on the continuous variables q}. But then the 
operators D}, which commute among themselves and with F., 
are not in diagonal form, and the elimination of q, cannot be 
accomplished as in the above method. 

Instead of introducing a cavity as in the above we may 
employ a rectangular parallelepipedon with the ‘‘ boundary 
condition ” that all functions are to be periodic functions whose 
periods are the lengths of the sides of the parallelepipedon. 
We can then introduce running instead of standing waves as 
characteristic functions for the electro-magnetic field; this gives 
rise to a better agreement with the physical picture in which 
a photon corresponds to a homogeneous plane wave. ‘The 
energy and the momenta are then also in diagonal form if we 
neglect the interaction between matter and light. Equation 
(12.10) then causes some difficulty, as its right-hand side 0 
must be replaced by the constant mean value of the charge 
throughout the entire space in order that a periodic solution 
be possible. On taking account of protons in the theory this 
will automatically correct itself, as the total charge will then 
be 0. 

The dynamical law allows only those quantum jumps of the 
particles in which one n, falls from 1 to 0 and another ną jumps 
at the same time from 0 to 1. Consequently the total number 
of particles Zing, and therefore the charge, remains fixed; hence 


that portion of the dynamical laws in which the total number 
is a given finite n is separated off from the remaining portion 
and intercombinations between the two do not arise. Dirac 
has proposed to interpret the presence or the absence of a proton 
in the state of positive energy u as the absence or the presence, 
respectively, of an electron in the corresponding negative energy 
state — u; our laws will then include protons as well as electrons.?! 
Remembering that the numbers n, = 0, 1 were at first intro- 
duced merely as an arbitrary index indicating the rows of a 
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matrix, there is nothing to prevent us from replacing the numbers 
n, tor negative —p by nm, =1—n_,, keeping nt = n, for 
positive p. The theorem of the conservation of charge is then 


Sn} — En; = const. (u> 0). 

But we thereby alter the content, as well as the notation, of 
the theory ; we are now interested in that part of the dynamical 
equations in which only a finite number of n, with positive p 
are different from 0 and only a finite number of n, with negative 
p are different from 1! The quantum jump of an electron 
between positive and negative energy levels, which was so un- 
desirable in the Dirac theory as formulated in the previous 
section, now appears as a process in which an electron and a 
proton are simultaneously destroyed and as the inverse process. 
The assumption of such an occurrence, for which our terrestrial 
experiments offer no justification, has long been entertained in 
atrophysics, as it seems otherwise extremely difficult to explain 
the source of the energy emitted by stars. 

However attractive this idea may seem at first, it is certainly 
impossible to hold without introducing other profound modi- 
fications to square our theory with the observed facts. Indeed, 
according to it the mass of a proton should be the same as the 
mass of an electron; furthermore, no matter how the action 
is chosen (so long as it is invariant under interchange of right 
and left), this hypothesis leads to the essential equivalence of 
positive and negative electricity under all circumstances—even 
on taking the interaction between matter and radiation rigor- 
ously into account. 

Having now quantized the field equations, we must return 
to the question of how the constituents M, M ' F of the action 
behave under the substitutions (6.12), (6.13), (6.14). The first 
two substitutions, which we may call (a) and (5), have exactly 
the same effect as before. But the third substitution (c), 
which sends the components of # over into the components 
of % or their negative, now affects M and M’ differently, for 
ys and # are no longer commutative with respect to multiplica- 
tion—they are, in fact, almost anti-commutative. From this 
it is found that M, M’, F behave under (c) in exactly the same 
way as they do under (b), i.e. they are multiplied by the signs 
—, —, + respectively. Hence past and future play essentially 
different réles in the quantized field equations ; we find no sub- 
stitution which leaves these equations unchanged while reversing 
the direction of time. It seems to me that we have thereby 
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reached an extraordinarily important goal of physics. We 
can now obtain the substitution 


ta — fa (a = 0, 1, 2, 3) 
py > = Pa pa —> bs, ps —> Pa, p> — 3 


on combining (a), (b) and (c); this substitution neither affects 
the co-ordinates nor disturbs the quantized wave equations. 
In view of Dirac’s theory of the proton this means that positive 
and negative electricity have essentially the same properties 
in the sense that the laws governing them are invariant under 
a certain substitution which interchanges the quantum numbers 
of the electrons with those of the protons. The dissimilarity 
of the two kinds of electricity thus seems to hide a secret of 
Nature which lies yet deeper than the dissimilarity of past and 
future. 


§ 13. The Energy and Momentum Laws of Quantum 
Physics. Relativistic Invariance 


In quantizing the wave equations the spatial and temporal 
variables were treated so differently that the relativistic in- 
variance of the resulting laws might seem to be open to serious 
doubt.. But a thorough investigation due to Heisenberg and 
Pauli reassures us on this point. We carry through these 
considerations on our action principle—but in such a way that 
the general validity of the argument may be readily seen. At 
the same time this offers an opportunity to discuss the meaning 
of the quantization more thoroughly than we have done hitherto. 


I. The Energy and Momentum Laws of Quantum Physics. 


We begin with the 4 + 3+ 3 operators p, fp, E, which 
are functions in 3-dimensional space satisfying the commutation 
rules (12.4) and the supplementary rules there set forth. There 
exists one, and in the sense of equivalence only one, irreducible 
solution of these conditions. From it we obtain the energy 


density tọ defined by (6.5), (6.6) and integrate it over all of 
space : 


Fo = (Rav. (13.1) 


We next construct the “ commutator ” 


8D = [Fy, D] = 315. DF) 
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of an arbitrary operator ® with Fẹ Consider the result of this 
for the particular operators D = W, fy, E,; it should be possible 
to evaluate these commutators using (12.4) and the supplement- 
ary rules alone; if one of the quantities involved appears as a 
derivative with respect to a spatial co-ordinate it should be 
transformed by integrating (13.1) by parts—or by deducing 
commutation rules for it from (12.4) in terms of appropriately 


defined derivates of the è function. If = is that process 


0 
involving only differentiations with respect to the spatial vari- 
ables, but which coincide with the derivative with respect to 
time in virtue of the Maxwell-Dirac field equations, we find 


(d Lap. sp — fe fo up. 
oy, = (sm “+ ifa Hr ) Ò fp = a = aE, | 
(13.2) 


We now drop the normalization fy = 0. It follows from these 
equations that ò® for any gauge invariant operator ® coincides 
with its time derivative as defined in terms of its spatial deriv- 
atives by means of the field laws. We may therefore replace 
the Maxwell-Dirac field equations by the quantum mechanical 
dynamical law i 


d3 
ax, 7 To (13.3) 


% represents the probability state of the physical system (pure 
state!) at the time x); it is a vector of that vector-space in which 
our operations take place. The fundamental concepts here 
involved are contained in the general programme of quantum 
mechanics as set forth in II, § 7. The “ density of electricity 
at the point P ” is, for example, represented by the operator 
p = dy, + + + which is independent of time. The changes 
in the probability distribution for this physical quantity in 
course of time are due to the changes in the state 3 and not to 
changes in p itself; the rule for the calculation of this probability 
distribution from p and 4 is given in the general programme 
referred to above. The same remarks apply to any gauge 
invariant quantity ®. However, it is more desirable to con- 
sider the “density of electricity ” (without specifying either 
time or position) as a fixed physical quantity represented by a 
definite operator p, and to ascribe the variations in its prob- 
ability distribution in time and space to changes in the prob- 
ability state 4 considered as a function of the spatial co-ordinates 


266 APPLICATIONS OF GROUP THEORY 


%1, ča Xa in addition to the time x». We should then expect to 
find four equations 


T= Sah (a = 0, 1, 2, 3) (13.4) 
in place of the one (13.3) in which the operators 
Fa = [EAV 


are those representing energy and momentum. Only now that 
we have formulated the general scheme of quantum physics 
in a manner which is symmetric with respect to the spatial 
and temporal co-ordinates, as required by the theory of relativity, 
can we consider it as complete. In order to determine the 
mean value of a quantity such as the electric density p we must 
assign to the spatial co-ordinates %,, ¥ə, %3, on which the operator 
p depends, any definite values x} (e.g. 0). The spatial com- 
ponents of equation (13.4) tell us that the replacement of (x}) 
by a neighbouring point (x? + dx,) amounts to the same thing 
as subjecting the normal co-ordinate system in system space, 
to which the vectors 3 are referred, to the infinitesimal rotation 


UF, dx, + Fa dxa + Fz dg). 


+ We must not forget that the equation (13.3) is not equivalent 
to the complete set of field equations, for we have omitted the 
one 


o(P) = divE+p=0 


which does not involve differentiation with respect to time. We 
must therefore restrict ourselves to vectors 3 which satisfy all 
the equations 


o(P)z = 0. (13.5) 


These equations define a linear sub-space R, of the original 
system-space R. The operators o(P), o(P’) associated with any 
two points P, P’ of space are commutative : 


a(P) o(P’) — o(P’) o( P) = 0. 
It is of prime importance that o(P) commute with F, i.e. that 


da = (Foo = oF 9) = 0; 


that this is the case follows from the fact that the equation 

oo ba ; 

TA 0 is a consequence of the remaining field equations in 
0 


the classical field theory, and consequently—independently of 
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our field equations—we may conclude that the gauge invariant 
operator o satishes the equation ôs = 0. This commutativity 
of o(P) and Fo guarantees that the infinitesimal rotation 27 ydx, 
of system-space during the time interval dx, does not carry the 
vector 3 lying in the sub-space R, out of Re. 

Continuing our programme, we now set 


= fg av 
and investigate the ‘‘ commutator ” 
p = Rar D] 


of an operator ® with ¥,; we shall denote this commutator by 
5, whenever confusion might arise between it and the commutator 
6 = 6, with Fo We find the equations * 


By = (so + + th); 8h = 0, Sfp = H, = 22 2h... 


0X, Da 
dE dE dE 
— —! 2 o£ “i — = aR + kd * 
ôE, = QE t Xs TP )= wry a, OE OX,’ 


(13.6) 
From this it follows that for any gauge invariant quantity ® 
dP . . . 
we have 60 = x, Ot taking the equation o = 0 into account. 
1 
Hence the way in which gauge invariant quantities depend on 
the spatial co-ordinates can in fact be described as we predicted : 
the operators representing them are constant, but the vector 
4 representing the probability state varies in space in accordance 
with the equations (13.4) for « = 1, 2, 3. 
That the four equations (13.4) are consistent also follows 
from these considerations. In the first place we have 


50=0 or of P\F,— F,o(P) = 0 


in the entire space R; this follows from (13.6). In the classical 
field theory the differential conservation theorem 


ato (28 ot? af) o 
dLa 0%, | dxa dry 
is a consequence of the field equations. Since “7 is a gauge 


invariant, it follows that after the quantization the operators 
satisfy the relation 


vw. le 
TENC EEEREN 
of t t eT n 


* In contrast with (6.2) we now employ the letter 5, without the factor 
1/x, as an abbreviation for curl f. 
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in the space R, defined by (13.5). Integrating over the space 
Xo = const. we obtain 


Sof LAV = 0 or FoF, — Fr. Fo = 0. (13.7) 


[The equation which takes the place of (13.7) for the entire 
space is 


Fo Fi — Fai Fo = [oE dV.] 


Furthermore, 


in Re, and on integrating this over space we find 
Sef BAaV = 0 or Fefe — Fe = 0. 


We thus see that the operators ¥, are commutative in R,, and 
consequently equations (13.4) possess one and only one solution 
when the initial value of 3 (i.e. at the origin of the space-time 
co-ordinate system) is a given vector in Re. 


Il. Relativistic Invariance. 


On transforming from the normal co-ordinate system x, in 
Space-time to another x, by means of a Lorentz transformation 


3 
A: x = So.ex 
a Pn aß ~B 


the solution of the equations 


= 1 d 
10x, ô ( ) 


is, as we shall show, obtained from the solution of (13.4) by 
means of a unitary transformation U induced in system-space 


by A. That is, there exists a unitary transformation U such 
that 


l ' 
=A(U8) = (EFada) (U3) 
is satisfied in virtue of (13.4) : 
Us SF, dx, = LF, dxs U 
æ a B 


or 


UF. = Zopa Jg U. (13.8) 


LAWS OF QUANTUM PHYSICS 269 


We could also say that (13.4’) have the same solution 4 as (13.4) 
but that the normal co-ordinate system employed in system- 
space has undergone the unitary rotation U, for the vector U3 
has the same components with respect to the new co-ordinate 
system as 4 had with respect to the old. We are only able to 
give the transformation U explicitly for infinitesimal A : 


longl = 1 + Woupl; U= 1+ }8M. 
The equations (13.8) which are to be verified are then 
ZJ S0ga = [8M, Fa]. 


In particular, the operators in system-space which correspond 
to infinitesimal rotations in -physical space are, as we have 
long known, those representing moment of momentum: that 
6M corresponding to the infinitesimal rotation D, : 


ÒXo = 0, 6X4 == 0, ÒX m X3, ÒX == Xo (13.9) 
about the x-axis is the x,-component of moment of momentum: 


(My =) Mog = | (2 — xad. (13.10) 
The infinitesimal Lorentz transformations which actually repre- 
sent a re-partitioning of the world into a new space and a new 
time are dealt with in exactly the same manner; it will suffice 
to consider as typical of such transformations 
bX —r vy, Òx =— Xo; OX» = 0, bX = 0. 


The òM associated with this transformation is 


M io = fz tf dV + \x, t9 aV; 


the second term, which vanishes for x, = 0, can be omitted, 
for we have already shown that F, commutes with all ¥,. This 
term does not fit into the present scheme, in which all the 
operators are functions of xı, %a % alone. Our problem is thus 
reduced to showing that in R, 


[Mas F] = 0, 0, Fs, — A) _ (13.11) 
[Mio Fax] = Fr, T Fo, 0, 0 for * 9, l, 2, 3. (13.12) 


Furthermore, the invariance of equations (13.5) which define 
the sub-space R,, will be proved by showing that the equations 


[Mas 0] = 0, [Myo o] = 0 (13.13) 
hold in the entire space R. 
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In order to prove (13.11) we make use of the identities 
0 » 40 
[penny =0 [x=], 2,3]. 
Xx 
Introducing the Kronecker 6,,, the integrand may be written 
0 0 ds (lg 
(Sse i3 — Šas t2) + (mse Xs 52). 


In consequence of o = 0 and since £ = 8, £ are gauge invariants 
the operations 


č may be replaced by 8, = [%, d], 
whence 
(Sna Fa — Sna Fa) + Sa | (xa 8 — xs 8)dV = 0 
or 


òx M o3 = [Fees Mss] = Ou3 Fo — Da2 F3 [a = 1, 2, 3]. 
In the classical feld theory the conservation law 
yB — Xa ta) — 
a= 0 0X, 
is a consequence of the field equations, whence on quantizing 


3 — 
Slt b — Xt) + LD (rati — ra t) = 
cms] xX 


holds identically in Re. Integrating over the whole of physical 
space we obtain 


0 


æ 


SoM as = [Fo Mo] = 0; 
equations (13.11), i.e. 
[M z, Fa] = a2 Fa — Sag Fo [æ = 0, 1, 2, 3], 


are thus completely verified. 
The relations (13.12) are obtained in an analogous manner 


from 
d(x, t 
(aay =0 [for a= 1, 2, 3] 


and from the equation 


j {S(x t) + AV = 0 
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which parallels the conservation theorem 


d(x £3 + X% to) 
| (xy 0 ss Xo Day — 0 
Xp 


of the classical field theory. 

We should expect the operator functions expressed by the 
wo, fo, Ep depending on the spatial co-ordinates, to be in- 
variant if we associate with an infinitesimal rotation of the 
spatial co-ordinate system an appropriate linear transformation 
of the components %, among themselves and of the vector 
components fp Ep and at the same time subject the normal 
co-ordinate system in system space Re to the corresponding 
unitary transformation. In formule: We expect the process 


òP = [Mz P] 
to yield the equations 
sr yr lo 
ofp = Jo + (Öp fa — Sos f2), 
òE, = VE, + (S50 Es — 593 E2), 


where we have written 
, ID WD 
Ò Ø = Xo va — Xs Ky 


But we find by direct calculation that 


, l 
òp = Ob + 1(%yf, — Xa fo) — aaa 


Of, = x Ha + x3 Ha, off = — x, Ay, Òf = — x Ay, 
DE p = DE, + òp! Es -f= Xa o) — 8 .n3(L -+ Xa o). 


We first observe that these equations yield 
So = [Mz o] = 0 


independently of the condition o = 0. On introducing the 
condition ø = 0 we find from these equations that gauge in- 
variant quantities ® exhibit the expected behaviour. The 
_ second of the equations (13.13) can be obtained by an analogous 
computation. 
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D. Quantum KINEMATICS 


§ 14. Quantum Kinematics as an Abelian Group of 
Rotations 


If we consider the operators ip, ig as infinitesimal unitary 
rotations of the ray field in system space, then Heisenberg’s 
commutation rules [II, (11.4)] assert that these rotations are 
commutative; consequently they generate a 2/-parameter 
Abelian group, where f is the number of degrees of freedom. 
Let us therefore investigate the properties of Abelian groups 
of unitary rotations in the ray field of n-dimensional space ! 
On introducing a gauge as in III, § 16, to each such “ rotation ”’ 
there corresponds a transformation of vector space with matrix 


A and between any two matrices A, B there exists an equation 
of the form 


AB = eBA. (14.1) 


This equation is possible only if € is an n™ root of unity, for on 
evaluating the determinant of both sides we obtain e” = 1. 
From (14.1) we obtain by mathematical induction 


AB = skB Ak 
AB! — eB A, } (14.2) 
for k, L= 1, 2, 3, ++. On combining these two equations by 


applying the second to A* and B instead of A and B we find 
the general rule 


AtB! = etl B! Ak, (14.3) 


Taking k = n in (14.2) we are led to the equation A*B = BA"; 
if the Abelian rotation group is irreducible Schur’s fundamental 
lemma allows us to conclude that since Æ” commutes with all 
elements B of the group it must be a multiple of the unit matrix : 
AXi. The order of any element of an irreducible Abelian 
rotation group in n dimensions is consequently a factor of n. 

An f-parameter continuous rotation group is generated by 


an f-dimensional linear family g of infinitesimal unitary corre- 
spondences 


oy Ca Hoal tH: e oC; (14.4) 
in terms of a basis formed by any f independent elements 


Cy, Cx, * + +, Cy of the family. The numerical parameters 
Gi, %, * * *, Oy may assume all real values. Setting o; = q, dr 


npldd i A 


a, Ps $ 
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and reiterating the infinitesimal transformation (14.4), we find 
that at “time ° 7 the resulting transformation is 


U(oy, Oo, ° * s op) = err + aCe tert ofCy (14.5) 


where we have replaced az by o; U runs through the entire 
group, which is now expressed in terms of the parameters o. 
If the group of unitary transformations of the vector space is 
Abelian the C, must satisfy the conditions 


CC, — CLC, = 0. (14.6) 
From this it then follows that all the elements (14.5) of the 


group are mutually commutative, for if AB — BA = 0 we have, 
as in the domain of ordinary numbers, 


e4. eB == ete, 
The parameters ø in (14.5) are added on composition : 
U(o,,° °°; as)U(oy, e, o) = Ulna +a, ts, os + ay). 


If, however, only the rotations of the ray space are commutat- 
ive, we find in place of (14.6) conditions of the form 


CC, — CC, = ttyl, 


where the cą constitute an anti-symmetric system of real numbers. 
The commutator of the infinitesimal transformations with 
matrices 


Asalyt poly, B= aitt ETC 


AB — BA =id¢,o,7,°1. 


HMs v 


We shall refer to the anti-symmetric form 
lw Ty = hlo, 7) 
My 


as the commutator form; it is invariant under change of basis. 

On writing 1 + a 1 + 2 in (14.3) in place of A, B and allowing 

= l = m—> œ, we find that the commutator of any two 
elements U(o, Ca °°‘, oy) = U(e) and U(r) of the group is 

U(e) U(r) U-*¥(e) U-¥(r) = e[h(o, 7)] + 1. (14.7) 

If the rotation group is irreducible a fixed U(o) can only 

commute with all U(r) if it is a multiple of the unit matrix, 


i.e. if all its parameters o vanish. From this we conclude that 
the commutator form is non-degenerate, i.e. that it cannot 
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vanish identically in 7, for a fixed set of values o,, unless all 
o, = 0—thıs amounts to the same as the condition |c,,j + 0. 
Such a form exists only if the number f of variables is even, in 
which case it can, by appropriate choice of the basis (i.e. by 
transforming the variables øo; and 7, cogrediently under an 
appropriate transformation), be reduced to the canonical form 
in which the matrix ||c,,|| is decomposed into 2-rowed sub-matrices 


0 1 
—l 0 


arranged along the principal diagonal.* It is then desirable to 
write 2f in place of f and to denote the “ canonical basis” so 


obtained by 

iP, 10, (v=1,2,--+ +, f) 
and the corresponding parameters by o,, q, The factor i has 
been introduced in order to express the results in terms of 


Hermitian operators P,, Q,. The basic elements then satisfy 
the commutation rules 


1(P,Q,—Q,P,) =1, (P,Q, — QP) = 0 
for u + vy and 
P,P, — P,P, = 0, 0,0, — 2,0, = 0 
for all p, v. The elements 


U(c) = e(o,P, + oP, tee + oP) [e(x) = e”) 


then constitute an f-parameter Abelian group of unitary (vector) 
correspondences, as do also the 


V(r) = e(t:1Q, + Ta ee st 7 7Qy). 


But the commutator of elements U(c), V(r) belonging to these 
two sets, respectively, is 


Ulo) V (t) U-10) V-17) = elor, +- e oyty) * 1. 

We have now carried our development to a point where we 
can profitably return to the considerations of II, § 11. In 
the case of a system with one degree of freedom in classical 
mechanics any physical quantity associated with the system 
is expressed mathematically as a function f(p, q) of the canonical 
variables p, q. In making the transition to quantum mechanics 


we had previously restricted ourselves to polynomials in p, q. 
But the Fourier representation 


fip, q) = | felap + 74) Elo, 7) do dr (14.8) 


* See Appendix 3. 
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of a function f is applicable to a much larger class of functions 
this integral need not be interpreted literally, the essential 
point being that ıt represents a linear combination of the simple 
functions e(op + 7g). On considering ip, ig as infinitesimal 
unitary correspondences in ray space which are commutative 
in accordance with the relation 


pq — gp) = 1, (14.9) 
e(op + rq) runs through the group generated by them. If we 
now consider (ø, T) as the components of an element in the 
resulting group algebra, then (14.8) is its group matrix in the 
representation obtained by associating with (ø, r) the unitary 
transformation e(op + rq). This group matrix is Hermitian if 
the element is real, i.e. if 

Elo, r) = €(— o, — 7). 
A quantity f is consequently carried over from classical to 
quantum mechanics in accordance with the rule: replace p and 
q in the Fourier development (14.8) of f by the Hermitian operators 
representing them in quantum mechanics. In particular, the 
derivatives of f are represented by 

(oe) 


e- 
Jg“ if felop + rq) + a&(o, T) do dr, 


+o 
Jfa = if felop + rq) rél, 7) do dr. 


= OO 
On letting U(r) in (14.7) again in infinitesimal we find, with 
the aid of the commutation rules (14.9), that 
p-elop + T4) — elop + 7q) ` p = 7: elop + 79), 
q- elop + 7q) — elap + 79) `q = — o: elop + Ta). 
We therefore have in general 
Pa S a a 
as required in order that the Hamiltonian equations 
dg ap 
ae fe gp ee 
be equivalent to the quantum-theoretical equations of motion 
for the vectors of system space. 

We have thus found a very natural interpretation of quantum 
kinematics as described by the commutation rules. The kine- 
matical structure of a physical system 1s expressed by an irreducible 
Abelian group of unitary ray rotations in system space. The real 
elements of the algebra of this group are the physical quantities of 
the system; the representation of the abstract group by rotations 
of system space associates with each such quantity a definite 
Hermitian form which “ represents’ it. If the group is con- 
tinuous this procedure automatically leads to MHetsenberg’s 


276 APPLICATIONS OF GROUP THEORY 


formulation; in particular, we have seen how the pairs of 
canonical variables then result from the requirement of irre- 
ducibility, whence the number of parameters in such an irre- 
ducible Abelian group must be even.*® 

If one of the canonical co-ordinates, say g, is a cyclical 
co-ordinate with period 27, then all quantities of the physical 
system are represented by periodic functions with period 2n. 
Consequently the only values assumed by the parameter r 
associated with q in (14.8) are multiples of 2 and the integral 
is to be replaced by a sum. In such a case we are no longer 
dealing with a continuous group, but with a mixed (continuous- 
discrete) group. 

Our general principle allows for the possibility that the 
Abelian rotation group is entirely discontinuous, or that it 
may even be a finite group. Thus we have discussed in III, 
§ 16, a group of order 4 and an irreducible ray representation 
% of it in 2 dimensions. That such groups actually occur in 
Nature is shown by the fact that the group we have just men- 
tioned characterizes the kinematics of the electron spin dis- 
cussed in § 4. It can be readily shown that &% is the only 
irreducible representation of this group, and that it is in fact 
the only irreducible 2-dimensional group of unitary rotations in 
ray space. These results emphasize the remarkable nature of 
this simplest case. The quantization of the problem of several 
electrons discussed in § 11 also falls within our general scheme. 
In dealing with it we are interested in that Abelian group whose 
basic elements py («= 1, 2,- + -, 2f) are all of order 2; such 
a group consists of the totality of the 4 different elements 


PPT: + PY (n, = 1 or 0). 


The gauge can be so chosen that the corresponding unitary 
matrices p, in the irreducible ray representation in 2/ dimensions 
satisfy the equations 


Pa = 1, PaPa = — Papp (a = Ê). (14.10) 


The kinematics of the spinning electron is described by the 
simplest case f = 1 of this representation. 

Because of these results I feel certain that the general scheme 
of quantum kinematics formulated above is correct. But the 
field of discrete groups offers many possibilities which we have 
not as yet been able to realize in Nature; perhaps these holes 
will be filled by applications to nuclear physics. However, it 
seems more probable that the scheme of quantum kinematics 
will share the fate of the general scheme of quantum mechanics : 
to be submerged in the concrete physical laws of the only existing 
physical structure, the actual world, 


—— a M a a E E E A) 
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§ 15. Derivation of the Wave Equation from the 
Commutation Rules 


We now show by actual construction that there exists but 
one irreducible ray representation (excluding the identity) of 
a 2-parameter continuous Abelian group: namely, that one 
which leads to the wave equation. 

We obtain our 2-parameter continuous group as the limiting 
case of a finite group with 2 basic elements; our proof is rigorous 
only insofar as the validity of this limiting process is admitted. 
Let A, B be two commutative rotations of an n-dimensional 
Unitary space. On introducing the gauge we have an equation 
between their matrices: 


AB = BA, (14.1) 


in which, as we know already, e is an m™ root of unity. The 
system consisting of the two matrices A, B shall be irreducible. 
Let their commutator, the number e, be a primitive m™ root of 
unity, i.e. e” is the lowest power of e which is equal to 1; m is 
then a factor of n. The orders of the rotations A, B are also 
factors of n: A™—~1, B” œ= 1, so the gauge may be chosen in 
such a way that A" = 1, B” = 1. Let B be reduced to diagonal 
form by an appropriate choice of our normal co-ordinate system ; 
the elements b; in the main diagonal are then all nth roots of 
unity. Equation (14.1) then yields the following conditions on 
the elements of A = |la,,||: 


sain = Elir (15.1) 

We divide the indices ż and the corresponding variables x; 
into classes in accordance with the rule that 7 and k belong to 
the same class if the quotient b,/b, is an m'" root of unity, i.e. 
a power of e. That this process really results in such a division 
into classes is shown by the fact that if D,/b, and b,/b, are powers 
of e, then b,/b, is also. By (15.1) ay, = 0 if i and k belong to 
different classes; hence the matrix A is reduced in accordance 
with the division of the indices into classes. But in view of 
the assumption that the system A, B was irreducible there can 
therefore exist but one such class. 

Having established this result, we now proceed to a finer 
division into classes : 7 and k shall now be considered as belonging 
to the same class if b; = bẹ} We arbitrarily choose as the first 
of these classes that one for which b, = b and let the second 
consist of those for which b; = €b, the third with b; = 6b, -: -, 
the mth with b; = e""1b; this exhausts the set, for the (m + 1) 
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class b; = e"b coincides with the first. Let the variables be 
arranged and numbered in this order. It then follows from equa- 
tion (15.1) that all sub-matrices (7, k) of the matrix A are empty, 
i.e. a;, == 0, unless their row index 7 and their column index 
k belong to successive classes. The matrix A then has the 
form indicated in Fig. 3, in which all elements in the non- 
shaded portions are zero (and we have taken m= 4). The 
shaded portions are occupied by the sub-matrices A"), Al), 
- ++, Alm), Since A is unitary the sum of the squares of the 
absolute values of the elements of a row or column is 1; the 


same must therefore also hold for the rows and columns of 
each of the sub-matrices. The sum of the absolute values of 
the squares of all elements in A@) must then be equal, on the 
one hand, to the number of rows and, on the other, to the number 
of columns; the rectangle A is consequently a Square, and 
the number of indices in the second class is equal to the number 
in the first class, say d. By the same argument we sce that 
the number of individuals in each of the m classes is d, and hence 
n=md. The figure is to be corrected accordingly; each of 
the shaded matrices is now unitary. On subjecting the variables 
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of the first class to the unitary transformation with matrix 
A®)* the sub-matrix AG is reduced to the d-dimensional unit 
matrix. This normal form is undisturbed by a unitary trans- 
formation affecting the variables of the first set and the variables 
of the second set alike; we can therefore reduce the second 
sub-matrix to a multiple of the d-dimensional unit matrix, and 
so on through the (m — 1). The normal form so obtained is 
unchanged on subjecting the variables of each class to the same 
d-dimensional unitary transformation ; we may therefore choose 
as this last transformation one which reduces A™ to diagonal 
form. But the matrix A is then decomposed into d-sub-matrices, 
as can be seen by renumbering the variables, taking first the 
first members in each set, then the second, etc. The irreduci- 
bility assumption then tells us that there can be but one member 


in each set: d = l, n = m. Our matrices are now in the normal 
form: 


l e” 
0 } ertl 
A =— 0 1 B — ert2 
adgood---:-0 fo gate 
all elements not explicitly indicated are zero. The exponents 
in B are n successive integers and e is a primitive n™ root of 
unity. Finally, the equation A” = 1 yields a = 1. We number 


the variables from 7 on and take indices which are congruent 
mod. n as equal; the two correspondences are then 


+ 
A: y= lpup B: Xy = FXp 
On reiteration we find 
t 
AS: My == Xess, BY; Xy == ekty.. (15.2) 


‘The transition to continuous groups is now accomplished by 
passing to the limit n — oo. Let the basis tP, 1Q of the con- 
tinuous 2-parameter Abelian rotation group be normalized in 
accordance with (14.9). We identify the matrix A of the above 
considerations with the infinitesimal e(€P) and B with e(nQ) 
where € and 7 are real infinitesimal constants. Then e(oP) = 
A*, e(rQ) = B' when in the limit sé -> o, yr. € is now 
eln) and ef == e(ékr). e(7Q) represents the physical quantity 
e77- the values which it may assume are given by e’**, where 
7 is real and k runs through all integral values. In other words, 
the quantity q may assume the values ké: q may assume all 
veal numbers from — œ to + œ. (Of course k is to be con- 
sidered mod.n and kë mod.né, but né is a multiple of 27/7 
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and may consequently be infinite in the limit.) We therefore 
write q in place of ké, where q is understood to be a variable 
which runs through the possible values of the physical quantity q, 
and V£- (q) in place of x, (gq) is an arbitrary function, 


whose values are complex numbers, which satisfies the normalizing 
condition 


f Yla) | dq = 1. 


On passing to the limit in the second equation of (15.2) we 
find that the quantity e'7 is represented by the linear operator 


Yla) —> e - pla). 


Similarly we find from the first equation of (15.2) that 


pla) -> (q + o) 


is the operator representing e'°P?, On returning from finite to 
infinitesimal unitary transformations we find 


1:84) =9-va), p= 83) 
We have thus finally justified the assumption from which we 
started in Chapter II. 

The extension of these results to systems with several degrees 
of freedom causes no trouble. The kinematics of a system which 
is expressed by a continuous Abelian group of rotations is conse- 
quently delermined uniquely by the number f of degrees of freedom. 
The postulate of irreducibility allows us to conclude that the 
particular operators (15.3) of the Schrödinger theory are a 
necessary consequence of Heisenberg’s commutation rules.?’ 

P. Fordan and E. Wigner ** have given a very elegant group- 
theoretic proof that there exists but one irreducible matrix 
solution of equations (14.10), i.e. that one of degree 2f there 
mentioned and given in greater detail at the end of §11. 


CHAPTER V 


THE SYMMETRIC PERMUTATION GROUP AND THE 
ALGEBRA OF SYMMETRIC TRANSFORMATIONS 


A. GENERAL THEORY 


§ 1. The Group Induced in Tensor Space and the 
Algebra of Symmetric Transformations 


HE principal problem we propose to solve in this chapter 
/ 1s the group-theoretic classification of line spectra of an atom 
consisting of an arbitrary number, say f, of electrons, 
taking into account the reduction of the space Rf to {RS}, as re- 
quired by the Pauli exclusion principle, and the spinning electron. 
For this it 1s necessary to consider in detail the representations 
of the symmetric group, i.e. the group 7; of all f! permutations of 
f things. These are most intimately related to the representa- 
tions of the group u of all unitary transformations or the group 
c of all homogeneous linear transformations of a space Ra. 
This connection has already been touched upon in Chapter III, 
§ 5: the substratum of a representation of ¢ or u consists of the 
linear manifold of all tensors of order f in R, which satisfy 
certain symmetry conditions, and the symmetry properties of 
a tensor are expressed by linear relations between it and the 
tensors obtained from it by the f! permutations. 

A tensor F of order fin the n-dimensional vector space R = R, 
is defined by its nf components or, as we prefer to say, “‘ co- 
efficients ” F(iyi,+++*1,); each of the indices 7 runs from 1 to n. 
Tensors can be added and multiplied by arbitrary numbers ; 
hence the totality of such tensors F constitute a linear “ vector 
space ” W of n! dimensions. Further, F can be subjected to 
an arbitrary permutation s of its f indices, which can be thought 
of as a permutation of the f numbers 1, 2,--:, f attached to 
the indices i in the general component above; if s is the per- 


mutation — 
L> 1’, 2> 2, +++, fof 
281 
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then the tensor sF obtained by applying s to F is, by definition, 
that tensor whose coefficients are 


SF (ty, se 8 Ty) = F(iyty .. » ty). (1.1) 


It follows from this definition that for any two permutations 
sand t 


t(sF) = (ts)F. 
A linear correspondence F —> F”: 
F'(i + + + dy) = Zali st tz; Rye + + ky) F(Ry + + + Ry) (1.2) 
is said to be symmetric if the coefficient 
Qty + e ty; Rye + Ry) 


is unaltered on subjecting the sub-indices 1, 2, --:, f of both the 
indices 7 and k to the same arbitrary permutation s. The pro- 
cesses of addition, multiplication by a number and permutation, 
in the sense defined above, applied to tensors are invariant 
under symmetric linear transformations; and conversely, any 
transformation of tensor space under which these processes 
are invariant is linear and symmetric. The totality of symmetric 
correspondences constitutes an algebra 2: if A and B are ele- 
ments of 2 then 4 + B, AB and cA (c an arbitrary number) 
are also. The problem with which we shall concern ourselves 
is the reduction of 9 into linear sub-spaces P which are in- 
variant with respect to X, i.e. with respect to all symmetric linear 
transformations. Wherever in the following we employ the 
terms invariant, irreducible, etc., in referring to the tensor space 
Rİ, they are to be interpreted with respect to the algebra J. 

We give a brief résumé of our terminology. We are dealing 
with a vector space R and a system 2 of linear correspondences 


E> p = AL 
of R on itself; we may often prefer to use the term “ linear 
projection ° instead of “linear correspondence (operator) ° in 
order to bring out the fact that the correspondence need not 
be one-to-one. A (linear) sub-space P of R is invariant if an 
arbitrary projection A of the system 2 sends every vector 
t of P over into a vector of P; P is irreducible if it contains 
no invariant sub-space other than itself and the space 0 con- 
sisting only of the vector 0. We shall always understand by 
a complete reduction $ = P, + P, of the invariant sub-space 
% a complete reduction into two linearly independent invariant 
sub-spaces $, Pa, even when this is not explicitly stated. A 
lincar projection g ->g of the invariant sub-space P on the 


THE GROUP INDUCED IN TENSOR SPACE 283 


invariant sub-space $ is similar if two vectors y and 4 of $ 
which are related by a correspondence A of the system : h= At, 
are always projected into two vectors r’ and fy’ of QB’ which are 
related by the same A: yY = Ar’, $ and P are similar or 
equivalent: P ~P if a one-to-one linear and similar corre- 
spondence can be set up between P and P. In particular, 
these concepts are to be applied to the case in which the vector 
space is the tensor space Rf = Rİ of nf dimensions and Æ is 
the totality of symmetric transformations. 

In quantum theory the state of a system consisting of f 
equivalent individuals (electrons) with a system-space is 
described by a tensor of order fin R. The energy necessarily 
depends on each of the f individuals in exactly the same way ; 
hence the Hermitian operator which represents the energy is 
necessarily symmetric in our sense. The fundamental dynamical 
law therefore allows us to conclude that an invariant sub-space 
P of R has the property that if the tensor describing the state 
of the system 1s at any time in $$ no influence whatever can drive 
it out. A complete reduction of Rf into invariant sub-spaces 
$8 implies a corresponding reduction of the operator representing 
the energy; hence the term spectrum is reduced into classes 
of terms belonging to the various $, such that the members of 
one class can under no conditions combine with the members 
of another. Naturally this division into non-combining classes 
is to be carried as far as possible. But this problem is exactly 
the one proposed above—the only difference being that we are 
here only concerned with the totality 2” of symmetric Hermitian 
operators. However, this restriction is quite irrelevant, for 
any symmetric operator can be written in the form A = A, + iA, 
where 


lanal 


l l 
A, = 5\4 +4), A,= 5\4—A), 


are both Hermitian. 
On going over to a new co-ordinate system in the fundamental 
vector space R by means of a non-singular transformation 


x; = X altk)x, (1.3) 
k =] 
the coefficients of a tensor F are transformed in accordance with 
F (iit 7. fe 1) ns Z aah) (takes) . . a(tyky) i FRR <. 8 ky) 
(1.4) 


The transformation (1.3) in vector space induces the symmetric 
transformation (1,4) in tensor space. These induced trans- 
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formations, which we shall call ‘ special symmetric transforma- 
tions,” constitute a group 2» which is isomorphic with the com- 
plete linear group ¢ = ¢,; this representation of ¢ was previously 
denoted by (c)f. The group X, is contained in the algebra 2. 
Hence a sub-space $ of RI which is invariant under the algebra 
= is a fortiori invariant under the group 2). That the converse 
of this result is also valid is not so self-evident. Nevertheless 
for all questions involving only linearity 2, can be replaced by 
the more extended 2, for £ is what we might call an enveloping 
algebra for the group Za; by this we mean that any symmetric 
transformations can be expressed as a linear combination of 


appropriately chosen special symmetric transformations.: To 
show this we prove the theorem : 


vi homogeneous linear relation 
PR reedy; kyoto Ry) alge e iy; kytt k) =O (1.5) 
is satisfied identically by all symmetric transformations 
(ty + st iy; Ry + + keh 


if it is satisfied by all special symmetric transformations, i.e. if 
the equation 


iin ir kate adeliaey) +++ alik) = 0 (16 
is satisfied for all values of the n? variables x(ik) for which 
the determinant |x(ik)| + 0. l 

Proof, Denoting the pair (ik) of indices by j and calling the 
n®:.m values of j simply 1, 2, -- -, m, the left-hand side of 
(1.6) is a homogeneous polynomial of order f in the m variables 
x(ik) ye 


Dixa oar Men) nd Z bla fas . + fmit pii xt" 


, ! 
where A, F fate e toh Sme fand blfu Jat * ts Fm) sah ‘Fm! 


times that coefficient ¢Gijy + © * Jy) whose indices contain j=l 
j,times,] 2 fatimes ete. On denoting that variable x(j1j2°*" jy) 
in whieh the indices j =: 1, 2, +++, m occur fy, Jfa’ ', Jm times by 


yU fan’ fm) the left-hand side of equation (1.5) becomes 
bf Ss, CON SY Cfi Iont Sm). 


The determinant of the x(ik) is a certain polynomial D(%3%q °° * Xen) 
in the variables x, Our assertion is thus reduced to the well- 
knnwn theorem of algebra: let ¢(x), D(x) be two polynomials 
in the variables x; Xa" °° Zm the second of which does not vanish 
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algebraically, i.e. its coefficients do not all vanish. If d(x) is 
zero for all values of the variables for which the value of 
D(x) + 0, then ¢(x) vanishes algebraically. 

This theorem is proved for a single variable x as follows. 
If ¢(x) does not vanish algebraically it has a definite degree 
p20; let g be the degree of D(x). There are then at most 
p +q values of the variable x for which ¢(x) or D(x) vanish ; 
for any one of the remaining infinitude of possible values of 
x neither ¢(x) nor D(x) can vanish, contrary to assumption. 
The theorem is readily extended to polynomials in any number 
of variables by mathematical induction. The principal point 
is that the analytical vanishing of a polynomial for all values of 
the independent variables implies that it vanishes algebraically. 

In quantum theory the vector space is unitary : the transi- 
tion from one normal co-ordinate system to another such is 
accomplished by an arbitrary unitary transformation (1.3). 
The transformations thus induced constitute a sub-group 2 
of 2» which is isomorphic to the unitary group u,, i.e. the 
representation (u) of the unitary group. I assert that a sub- 
space P of Rİ which is invariant and irreducible with respect 
to & remains irreducible not only under the group 2, but under 
the more restricted group 2) as well. To prove this we must 
show that the identity (1.5) holds even when we assume only 
that (1.6) is true for those values of the variables x(ik) with 
unitary matrix. 

One of the most natural proofs of the above theorem con- 
cerning the formal vanishing of a form @¢ of order f depends on 
the process of “ polarization’: we assign arbitrary infinitesimal 
increments dx; to the values of the variables x;; the identical 
vanishing of ġ then allows us to conclude that the differential 


og 
— d 
ox; Xi 
vanishes for arbitrary values of x; and dx;. This procedure 
also leads us to the desired conclusion in the case under con- 
sideration. Denoting by ® the matrix obtained by transposing 


mal we have 


rows and columns in 


tr (dX) = 0 


where X, X+ dX are two arbitrary neighbouring unitary 
matrices. In order that this be the case we must have 


dX = 1X + bX 
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where 5X is an arbitrary Hermitian matrix: the ‘ rotation” 
X + dX is obtained by following up the rotation X with the 
infinitesimal rotation 1 -++7-6X. But the equation 


tr (BX + 8X) = 0 


implies the vanishing of ®X. This is seen immediately from 
the fact that a linear form 


Lain Vir 


in the variables y; = 6x(zk) vanishes identically if it vanishes 
for all values satisfying the condition Yz; = ix; indeed, any 
matrix Y = |ly,,|| can be written in the form Y, + iY, where Y, 
and Y, are Hermitian. On multiplying the right-hand side 


of @X =0 by X! we find ® = 0: all derivatives sam 
vanish in the same sense as ¢ itself, i.e. for arbitrary x(ik) whose 
matrix is unitary. But these derivatives are forms of order 
f—1; the truth of our assertion above is thus proved hy 
mathematical induction. 

Every invariant sub-space P of R is the representation 
space of representations of the groups ¢ and u which are con- 
tained in (c) and (u)J respectively. Hence the above results 
prove that if $ is irreducible these representations are also. 


§ 2. Symmetry Classes of Tensors 


One of the most natural methods of obtaining invariant 
manifolds of tensors F consists in subjecting F to linear symmetry 
conditions of the form 

a a(s): sF = 0, (2.1) 


This suggests introducing the symmetry operator 
a= Jia(s):s. (2.2) 


Such operators can be added and multiplied with arbitrary 
numbers, and two operators a, b can be applied successively 
with the same result as the symmetry operator c = ba defined by 


e(s) = X (i) a(t). (2.3) 


In other words, we are here led in a most natural way to the 
algebra p of the symmetric group 7 = zy of all permutations s. 
The elements of this algebra, which constitute an f!-dimensional 
linear space t, appear as operators which can be applied to 
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tensors of order f. We may call the numbers a(s) appearing 
in (2.2) the components of the element a. In particular, @ is 
an Hermitian operator in the tensor space Rf if it is a real 
element, i.e. if it coincides with its Hermitian conjugate @ 
defined by the equation 


a(s) = a(s~*), (2.4) 


Hence these real symmetry operators represent physical quan- 
tities of the physical system consisting of f equivalent individuals, 
whose total system space is R; quantities of this kind are 
unknown in classical physics and cannot be pictured in terms 
of the usual spatial and temporal models.? 
(2.1) or 
>» a(s)x(s) = 0 
8s 


is a linear condition which is imposed on the element x = F 
defined by x(s) = sF. A symmetry class is defined by one 
or more equations af this kind ; we are thus led to the definition : 

Each linear sub-space p of t determines a symmetry class Ẹ 
of tensors. F belongs to P when the corresponding symmetry 
quantity or element F is in p. It will be found convenient to 
denote the poges by which % is generated from p by a symbol ; 
we write $ = 

If the rea a finds it difficult to operate with elements F 
whose components sF are tensors rather than numbers he may 
replace the tensor by the totality of its coefficients F(z, 7, + + + 1,) 
and F by the elements 


X = F(iyty + + * 14) 


associated with each definite set of indices (tfa © ++ ty); this x 
is defined by the equation 


x(s) = SF(iqtg + + * ip). 


The requirement that F belong to p means that F(iig - © - iy) 
belongs to p for all the nf possible combinations of the indices i. 
That the symmetry class $ = ftp is invariant with respect to 
all symmetric transformations (1.2) is due to the fact that (1.2) 
implies the corresponding equation for the elements F, F’. 
F' (iyi, ++ ip) is a linear combination of the elements F(R, : - + Ry) 
associated with the various combinations (kks +--+ ky) of indices k. 

If F belongs to p then a- F does also, where a is any element 
whatever of the algebra. To show this we note that the 
s-component of 


288 THE SYMMETRIC PERMUTATION GROUP 
is given by 
Zal’) S rsF(i sees is) = Salr `’) # SF(k, a Rs) 


T 

where the k, - - +, ky are obtained from 1, * * *, ty by the per- 
mutationr, Hence H (i, * ++, iy) is a linear combination of those 
F(k, + - - ky) whose indices k are obtained by a permutation of 
the indices 2. : : 

The principal question now is whether every invariant 
sub-space P can be generated from a p by the process f and 
further, whether or to what extent this generating p is uniquely 
determined by $. The answer is perhaps best expressed with 
the aid of the inverse process ' which generates a p from tke 
given $. The following geometrical analogy may be useful 
in enabling the reader to understand the situation with which 
we are dealing. Let the points x of a plane with a fixed centre 
correspond to the elements of the algebra p and the line segments 
F going out from the origin correspond to the tensors. On 
contracting the entire plane, leaving the centre invariant, in 
the fixed ratio r (0 S71) the point x goes into the point 
TX and the segment F into the segment rF; this contraction 
of segments shall be the analogue of the symmetrical trans- 
formations of tensors. ‘$8 will now denote an “invariant ” 
set of segments, i.e. a set such that if it contains the segment F 
it also contains all the contracted segments rF. Just as we 
associated the symmetry elements F(z, +++ iy) with the tensor F 
we now associate with the segment F the continuum of points 
F(r) of F; F(z) is the end point of the segment rF. Let p be 
any set of points; the segment F will then be included in the set 
P = tp if and only if all its points F(7) are in p. Obviously the 
only segment sets P which can be obtained in this way are 
those which are invariant, and all such invariant sets can be 
so obtained. Only the “ core ” py of the point set p is essential 
to this construction; Po consists only of those points x such 
that 7x belongs to p for all 7 (in the interval 0 Sr Sl). Po 
is invariant in the sense that with x all rx belong to pẹ That 
only the core po is essential means that our construction generates 
the same segment set P from two point sets p, p’ if these latter 
have the same core; hence we can restrict ourselves ab initio 
to the consideration of invariant point sets p = pọ. It is extra- 
ordinarily easy to find the point set p which generates a given 
segment set P: we include in p those and only those points 
lying on the segments of $, and this p is automatically invariant. 

If the reader will think through this geometrical illustration, 
which we have formulated here in such a pedantic manner, he 
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will have no trouble in understanding the analogous situation 
for tensors and symmetry elements. A linear sub-space p of t 
is to be called invariant if all elements ax are in p, where x is 
an arbitrary element of p and a is any element whatever.* 
Hence such a is invariant under the totality of correspondences 
of the form 


(a): x> x =ax (2.5) 


On associating this correspondence (a) of r on itself with the 
element a we obviously obtain a representation of the algebra p 
(and therefore of the group ~p); it is called the regular 
representation. (r appears here twice: once as the repre- 
sentation space and again as the algebra p represented in this 
space; the first will be expressed by the German letter r, the 
second by the Greek p. We are here doing the same thing as 
in III, § 2, where we obtained a realization of the group g by 
associating with the element a of g the correspondence s > s’ = as 
of the group manifold on itself.) This regular representation 
supplies us with material from which we can construct all— 
and hence in particular the inequivalent irreducible—repre- 
sentations of the algebra p. When we use the terms invariant, 
irreducible, etc., in r they will always refer to the algebra of all 
correspondences (a) of r on itself, which is simply isomorphic 
with the algebra p of all symmetry elements a. being an 
invariant sub-space of r, we shall always refer to the representa- 
tion induced in p by the regular representation simply as the 
regular representation in p; it associates with each element a 
the correspondence (2.5) of p on itself. The equation x’ = ax 
is, in terms of components, 


x'(s) = Zalr")x(r5). 


Let x be an arbitrary element of p; the requirement that p be 
invariant allows us to conclude that the element x’ defined by 
x'(s) = x(rs) is also in p, where r is any fixed permutation. _ 

Let p be an arbitrary sub-space of r; we say that x belongs 
to the core pa of p if and only if all quantities of the form ax 
belong to p; this pọ is invariant. We thus have the theorem 
that two linear sub-spaces p, p’ generate the same symmetry 
class P = tp = tp’ of tensors if they have the same core. We 
may therefore restrict ourselves ab initio to the consideration of 
invariant sub-spaces p. 


í 


* This “ invariant sub-space ” is not the same as an “invariant sub- 
algebra ” as defined in Chap. III, § 13; to conform with our previous nomen- 
clature it should be called a “ left-invariant sub-algebra.”’ 


- a 
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It is possible that certain relations (2.1) will be satisfied by 
all tensors. Let ty denote the smallest sub-space of r which 
contains the elements F(z,z, - - + 1,) associated with all tensors 
F and all values of the indices (27, ° : ° ty). Then p generates 
the same $ = tp as the intersection of p with t; it is therefore 
natural to restrict ourselves further to the consideration of 
invariant sub-spaces P of ty. These remarks are not applicable 
if the dimensionality = f, for certainly the f! coefficients 


sF(I, 2, eo iS) — F(U, 2" ee SF) 
of the arbitrary tensor F are independent. But the situation 


is different in case n< f: for example, let 8, = +1 according 
as s is an even or an odd permutation; then 


Jò, * SF 


is an anti-symmetric tensor and must therefore vanish in case 
the dimensionality n is less than the order f. 

We can at most hope that conversely p is uniquely determined 
by $ if we restrict ourselves to invariant sub-spaces p which are 
contained in tẹ In order to prove that this is indeed the case 
we attempt to find the inverse process which leads from % 
to p, following the programme outlined by the geometrical 
analogy considered above. In case n =f this is readily done 
as follows: if F is any tensor in $ we let the element 
x = Fil, 2,-- +, f) in t correspond to it; p consists of all the 
elements x so obtained. But in order to obtain a method which 
is also applicable to the case n < f we must alter the procedure. 
We understand by p = 4% the smallest linear manifold containing 
the totality of elements F(i,, ig, © + >, iy) associated with all possible 
tensors F of $ and all possible combinations of indices Ce 
If the tensors E, constitute a basis for P, p consists of all elements 
of the form 

x= 2 & cali . es ty) » Bali 788 ty) (2.6) 
That such a p is invariant has already been shown above, for 
if x = Fli,i, > - + iy) the element x’ defined by x'(s) = x(rs) is 
equal to F(k,k, +--+ ky) where kky». ky are obtained from 
tyt, ** + Zy by the fixed permutation v. 

We now denote the t, introduced above by IR ; it coin- 
cides with the entire space r when n =f. Let the symbol 3 
denote ‘‘is contained in”; the following results then follow 
immediately from the definitions: If p is a linear sub-space of 
r and § = th, then be 3p. If $ is any linear sub-space of 
RF and p= IP, then conversely P 2 tp. We can at most 
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expect that the symbol 3 can be replaced by = if in the first 
theorem p is an invariant sub-space of tọ and in the second if 
$ is an invariant sub-space of Rf. That these converse theorems 
are in fact true under these limitations will be proved in § 4. 


§ 3. Invariant Sub-spaces in Group Space 


We are in need of a fundamental theorem concerning the 
algebra of a group as a preparation for carrying through the 
investigation proposed above; we here prove this theorem for 
a general finite group. However, we do not alter the notation, 
so here m denotes any finite group of order h. 

Theorem (3.1). If p is an invariant sub-space of t there exists 
an element e of the group algebra having the following two prop- 
erties: (1) every element of the form xe belongs to p, (2) every 
element x of p satisfies the equation xe = x. 

In particular (1) implies that e = le itself belongs to p, 
and hence by (2) ee =e; e is idempotent.’ Itis a “‘ generat= 
ing unit” of p in the sense that p consists of all elements of the 
form xe. 

Proof. Let e, ea * * *, €n be a co-ordinate system in the 
vector space t which is adapted to the g-dimensional sub-space 
p in such a way that p is the linear set defined by e, @,,-+-, ey. 
The parallel projection which transforms 


X= 40, +++ ++ xe, into xX’ = xe, te e e H %,e, 


has the two properties (1) it projects every x into an x’ lying in 
p, and (2) within p it is the identity. In the original co-ordinate 
system defined by the simple elements s of the algebra this 
projection is given by 


x'(s) = Sals, t)x(d), 


i 
where the matrix d(s, t) is necessarily of the form 
d(s, t) = e,(s)é&(t) ++ e e + els)é(t) 
and the &(s) are defined by 
2ré,(s)ex(5) = 6, (¢,k=1,2,°++ +, g 
The fact that p is invariant implies that if x is in p then the 
element x, defined by %,(s) == x(rs) is also in p. Consequently 


the projection with the matrix d(rs, rt) has the same two prop- 
erties (1) and (2), where y is any fixed permutation (i.e. element 
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of the group r) whatever. Hence the assertions also hold for 
the correspondence with the matrix 


els, i) = 7 Zdlrs, ri) (3.2) 


obtained by summing over all elements y of the group. This 
matrix satisfies the equation 


e(rs, rt) = e(s, t), 
whence e(s, t) depends only on the combination t's: e(s, į = 
e(f4s). The linear projection 

x'(s) = Le(s, t) x(t) 

t 
may therefore be written briefly x’ = xe, which proves the 
validity of the theorem. 
Let the invariant sub-space p be completely reduced into two 

invariant sub-spaces: p = p, + pz, and let e be the generating 
unit of p. Any element in p can be written as the sum of 


its components in p, and p,; hence in particular e = e, + e,. 
From this it follows that for an arbitrary element x of p 


X = Xe = Xe, + Xe). 
But since x, = xe, is in p, and X, = Xe, is in p}, x, and x, 
are the (unique) components of x in p, and Pa These two 
components for the element e, are obviously e, and 0, whence 


Cê, = Cj, e,e,— 0; 
similarly 
CoC) = 0, CÊ, = Co. 


Hence €,, e, are the generating idempotent units of Di, Pa re- 
spectively; they are ‘‘ independent” in the sense of the 
equations 


eĉ, = 0, ee, = 0. 
On completely reducing p into any number of components : 
-P = £p, the generating unit e of p is decomposed into 
4 
e = dle 
í 
the components of which såtisfy the analogous equations 
ee =0 (i+k), ee: = e, 


The existence of the generating unit offers a means of ob- 


taining a new and simpler proof of the fact that reducibility 
implies complete reducibility : 


INVARIANT SUB-SPACES IN GROUP SPACE 293 


Theorem (8.3). If), Pı are invariant and Yı -3 Y, then p can 
be reduced into Yı +- Pa in such a way that P, is also invariant. 

Proof. Let e, be the generating unit of p;. We decompose 
every element of p in accordance with the equation 


X= Xe, + (x —- xe). (3.4) 
The first component x, = xe, lies in p,, and the second 
Xo = X — xe, 


runs through a certain linear sub-space Pa of p when x runs 
through all elements of p. This sub-space p, is also invariant, 
for 

aX, = ax — (ax)e, 


as ax is in pif xis. The elements x,, X, of Pı, Pp. respectively 
satisfy the equations 


X6 = X, X = 0. 


From this it follows that the sum of an element y, of p, and an 
element x, of p, cannot vanish unless both y, and x, also vanish ; 
hence p, and p, are independent. To prove this we merely note 
that on multiplying Yı + x, = 0 by e, we find y,e, = y, = 0. 
Equation (3.4) represents the reduction of any element of p 
into its components in p, and pz. 

Any idempotent element e generates an invariant sub-space 
Pe consisting of elements of the form xe. If e,, e, are two 
independent idempotent elements (e,e, = 0, e,e, = 0) then the 
sub-spaces Pı, Pa which they generate are independent, and the 
idempotent element e = e, + e, generates p=), + Pa An 
idempotent element e is said to be primitive if it can only be 
expressed as the sum of two idempotent elements e, + e, if 
one of the summands is 0 (and the other e). In order that p, 
be irreducible it is necessary and suficient that e be primitive. 

Obviously any idempotent element e, in particular the 
modulus 1 of the algebra, can be reduced into the sum of 
independent primitive idempotent elements. For if we have 
a reduction into independent non-vanishing idempotent elements 


e = 6; +e, t+: +++ en 


and if, for example, e, is not primitive, it can be further re- 
duced to the sum of two independent non-vanishing idempotent 
elements e,’ + e,” ; in this way we obtain a complete reduction 
of e into m + 1 independent terms, for we have, for example, 


ele, = €,€,e, = 0; similarly e,e, = 0. 


SE enc natn emma) se EVES TNE Te eS 
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This process must certainly cease after at most A steps. Our 
analysis allows us to assert that we thus obtain a complete 
reduction of p, into independent irreducible sub-spaces. 

We have seen that the theorem concerning the complete 
reducibility is a consequence of the existence of a generating 
unit. But the converse is also true: If p appears as a summand 
in a complete reduction t = p + p’ of our given algebra r, then 
it possesses a generating unit. We need only to specialize the 
considerations developed above by applying them to the modulus 
1 of r; 1 can be completely reduced into the two components 
e + e’ lying in p and p’, and the generating units of p and p’ 
are e and e’ respectively. 

The mathematician will find it worthy of note that all these 
considerations are still applicable when the algebra is defined 
over any field whatever. Instead of dealing with the continuum 
of real or complex numbers, as in analysis, we may in abstract 
algebra operate in an arbitrary field, i.e. a domain of elements, 
called numbers, in which the two fundamental operations of 
addition and multiplication and their inverses, subtraction and 
division, are defined in accordance with the formal laws of 
ordinary arithmetic. Our development depended only on these 
rules of operation—with a slight restriction. There are fields in 
which a definite integer, say h, times any number of the field 
yields zero; we may say that k annihilates. Such ‘‘ modular ” 
fields must be excluded, for we wish to retain the possibility 
of finding a number such that its product with / is any given 
number. When our reasoning involves no more restrictive 
assumptions concerning the number field, we are operating in 
a relatively elementary theoretical domain. However, such 
theorems as the ‘‘ fundamental theorem ” III, (10.5), and that 
of Burnside-Frobenius-Schur, which depend on the fundamental 
theorem of algebra, belong to a deeper layer. These theorems 
hold only in ‘‘ algebraically closed" number fields, in which 
any algebraic equation (with coefficients in the field) is soluble. 
Finally such concepts as “ Hermitian,” “ unitary,” etc., involve 
the transition from a number to its conjugate complex and 
have no place in general abstract fields. Our earlier proof of 
the theorem of complete reducibility was obtained with the 
aid of such tools foreign to the general concept of a field. 

Theorem (3.5). A similarity projection x + x' of the invariant 
sub-space p on the invariant sub-space p' is necessarily expressed 
by an equation of the form x'=xb. (In particular, when p 
and p’ are equivalent this theorem is applicable to the one-to-one 
similarity correspondence p $ p’.} 

Proof. Let the given similarity correspondence send the 
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generating unit e of p over into b. 

xe then goes over into x’ = xb, wh 

but for such an element xe = x, 
Additional remark. The projection sen 

eb = b. . On the other hand, if e’ is the 

p’, then since Ð is in p’ we have be’ =b: 

b = eb = be’ = ebe’. 
We express this result, i.e. that b is of the form exe’, by saying 


b has the character (e, e’). Our considerations show that such 


a projection can always be expressed in terms of a unique 
element b of character (e, e’). 


In virtue of the similarity 
ere X 1s any element in p; 


ds e into eb; hence 
generating element of 


If we are operating in the field of complex numbers, with which the 
investigations of analysis (e.g. the theory of functions) deal and in 
which we are exclusively interested in quantum theory, we may supple- 
ment the theorem (3.1) concerning the existence of a generating unit e 
in an invariant sub-space p by the following: 

The generating unit may be so chosen that it is veal; it is then deter- 
mined uniquely by y. 

To prove this we choose as the basis e, e, ..., e, of p a unitary- 
orthogonal system of vectors ; then 


Le ,(s) é;,(S) = Ò ik (i, k, =R l, 2, ee + 8). 
8 


In constructing d(s, t), which we now denote by e(s, t), we may therefore 
choose %; = @;: 


g r 
é(s, 2) E Zels) RUF (3.6) 
I assert that the equation 


e(vs, rt) = e(s, £) (3.7) 


is automatically satisfied—it is no longer necessary to take its mean 
value as in (3.2), The element e defined by e(/-1s) = e(s, 7) is then the 
real generating unit of p. 

In order to establish the validity of (3.7) it is only necessary to 
note that e(s, t) is independent of the particular unitary basis e, ê, 
..., e, chosen; for on going over to a new unitary basis €}, €,,.. «, 
e by a unitary transformation U the bilinear form (3.6) remains in- 
variant. Now in particular the equation 


eils) = eils), 


in which 7 is a fixed element of the group, defines a transition to a new 
unitary basis. l l 

To prove that this real generating unit e of p is unique, assume there 
exists a second, e’; then all elements x of p satisfy the equations 


xe = XxX, XE = X. 


On applying the first equation for x = e’ and the second for x = e we 
have 
e'e = e, ee =e. 
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But since e and e’ are both real, the first of these results yields, on 
going over to the Hermitian conjugates, 

ee’ = ë, 

i the previous result we conclude that e’ = e. 

a adler ee Bend A the content of theorem (3.3) can be extended 
and its proof simplified. If e, e, are the real generating units of p, p, 
respectively, then since e, iS in p €; = &,, and on going over to the 
Hermitian conjugates we find ee, = e,. Hence the idempotent element 
e, introduced by e = e, + eis real and independent of e,; = p + py, 
is thus completely reduced into p, and an invariant sub-space p, which 
is unitary-orthogonal to p, and which has as its real generating unit e,. 


§ 4. Invariant Sub-spaces in Tensor Space 


We now return to the investigation of tensors of order f, 
the totality of which constitutes the space R. Let a again be 
the group of all permutations of f things and r ( = p) the corre- 
sponding group space (algebra). Let a be asymmetry quantity, 
i.e. an element of the algebra p, with components a(s); the 
element ĉ is then defined by 


&(s) = a(s-4) (4.1) 
ea Val 
which asserts that the tensor F’ is obtained from F by the 
operator a, is equivalent to the equation 
F'=F-& 


between the corresponding elements F and F’ of the algebra p. 
For 


The relation 


sF = gát”) stF 
t 


is in fact obtained from 
F' = Falt): tF = Våt’) tF 
¢ t 


by operating on it with the permutation s. 

In the following considerations, which are concerned with 
symmetry classes of tensors, p (with or without index) always 
denotes an invariant sub-space of r, ê the generating unit of 
p and P the corresponding #p. We may then say that e is 
the generating idempotent operator of the symmetry class P in 
the following sense : 

(1) eF lies in 8, F being any tensor whatever ; 

(2) if F isin $ it is reproduced by the operator e: eF = F. 

In this way we obtain a constructive definition of the symmetry 
class $ as the totality of all tensors of the Jorm eF. This definition 


i 
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is considerably simpler than the original one in terms of p, for 
it depends on a single element e instead of a manifold p. If, 
for example, we are dealing with the class $ of all completely 
symmetric tensors 


l 
FAs 


is such an operator; the corresponding operator for the class 
of all anti-symmetric tensors is the alternating sum 

1 

= D OSS. 

| fit 

Theorem (4.2). If P 3p or p=p, + be, we have W 3 P, 
P = Pı + Po respectively. 

We need to prove only the latter part of this theorem, 
i.e. for the case of complete reduction. The generating unit 
ê = @, + ê, of p has as components @,, @, in p,, Pz the generating 
units of P4, Pa respectively. The formula 

eF — e,f +. e,F 
defines the corresponding complete reduction of 38 into the 
independent invariant sub-spaces $, $a. 

Theorem (4.3). If Py ~ Pa then By ~ Bo. 


The similarity correspondence x,—> xX, of pı on P: is, by 
theorem (3.5), of the form 


A S, 
X, = X, b, Xļ =X b. 


Hence 


define a one-to-one similar correspondence of P, on P, and its 
inverse. | 

Theorem (4.4). If p -3 to then p = UR. 

The only non-trivial part of this first converse theorem which 
remains to be proved is that p 3 I}. All tensors of the form 
F, = eE, are in %, where (Ew) is a basis for the entire tensor 
space Jt’; hence all elements of the form 


Y = Lealty > > e ty) Felt, + + + 44) 
a, 
are in IB. On introducing 
X = Jie(t, °° * ty) Balt, + + * ty) 


a, ï 
we have y= xê. On recalling the definition of t, = IR we 
see that xê belongs to Mp if x lies in rẹ But in virtue of the 
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assumption that p -3 Tọ this is automatically satisfied if x is an 
arbitrary element of p; but then xê = x. Hence every element 
x of p is contained in 9%. 

In order to formulate the converse of these theorems let 
P (with or without index) now denote an arbitrary invariant 
sub-space of R/ and p the corresponding 4%. 

Theorem (4.5). If P 3 8 or B= B+ Be, then p’ 2 p, 
p = Pı + Po, respectively. 

Theorem (4.6). If P ~ P thenp ~p. 

Theorem (4.7). $ = Zp. 

The last theorem is by far the most important of all; it 
asserts that every P is a symmetry class of tensors. It is desirable 
to prove it first, i.e. to prove that fp -3 $. Let ê again denote 
the generating unit of p; tp then consists of all tensors of the 
form F’ = eF. Since the element @ belongs to p it is necessarily 
of the form 


é(s) = e(s73) = Zealhy "++ Ry): SEalky +++ Ry), (4.8) 
where the tensors Ex constitute a basis for the space P. Now 
the trivial equation 

scli, e. i) esFli ++ + 4) = Zeli, +++ ip Flip + + iy) 
shows, on replacing sc by c, that 
Zeli es ty) SFli + i) = Zs Melty vee i) Flay e+ > ty), 
Hence we may replace (4.8) by 
e(s) = Estalki -e + Ry)+ Ey(ky ++ + Ry) 
and the coefficients of F’ are then given by | 
F'li ++ + 4,) = Xiali ss ip; ky >> + ky)Ealki t+ k 


where 
Calia °° * ty; ka> + + Ry) = SsF(i, + + + iy) t Seah, © + + ky). 
8 


Because of the summation over all elements s of the group m 
this transformation with coefficients c, is symmetric; hence 
the assumption that the sub-space $ is invariant allows us to 
conclude that F” lies in P if the E, do. But this establishes 
our theorem. 

The theorem can also be proved directly, without calling 
on the theorems of § 3, in the following way. That F is in 
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tp means that F(i,ta © * - i+) is in p and is consequently of the 
form (2.6) : 


Fli +: ip) = Zbalty tee des Rye s+ Ry) + Bulky + + Ry). 


The E, constitute a basis of §8. Writing down the s- component 
of this equation and replacing the indices ty, +++, ty by iget", ty 
we find the equation 


Fig ++ i) = Zs bain + + tps ka + + By) Bally + + + By) 


for the components of F. Since this holds for every permutation 
s~1 we may sum over the elements of the group and obtain 


Fit, e ty) = Z Calia n. ly; kyt: ky) Ealky ->e Ry), 
where the coefficients 
Colt * ty; Ry s+ Ry) = = FE Sbali st ty; Ry st Ry) 


are symmetric. Hence since the E„ belong to the invariant 
sub-space P and F is obtained from them by a symmetric 
transformation, F also belongs to $. 

The only part of theorem (4.5) which is not self-evident is 
the assertion that Pı, Pz are independent. By theorem (4.7) we 
have the relations 


ip 2 4p, 3 Bo Bt 3 P 


for the (invariant) intersection p* of p, and p, But since 
%,, Pa are independent it follows that Zp*, and therefore p*, 
is empty. 

Theorem (4.5) shows the $ associated with an irreducible p 
is also irreducible. Hence it follows, in particular, that the 
manifold of symmetric and the manifold of anti-symmetric tensors 
are irreducible and invariant, not only with respect to the algebra 
of symmetric transformations, but also with respect to the 
transformations induced in tensor space by the affine or unitary 
groups of transformations in the vector space R. Applying 
this to the 2-dimensional vector space, we see that the repre- 
sentations ©, of ¢ = c, or u constructed in III, § 5, are irreducible. 

In order to prove (4.6) we must first examine the nature of 
to (for »</) in some detail. We call the component a(l) of 
an element a of the algebra the trace of a. Hence the trace 
of the product ab, which we call the scalar product tr(ab) 


of a and b, is 
tr(ab) = D’a(s)b(s~4). 
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The trace of a is then tr(al) = tr(la) = tr(a). The scalar 
product is obviously symmetric in a and b, and the symmetric 
bilinear form tr(ab) is non-degenerate, i.e. a = 0 is the only 
element for which the equation tr(ax) = 0 is satished identically 
in x. 

Auxiliary theorem (4.9). to is a left- as well as right-invariant 
sub-algebra of xr. tr(ab) is non-degenerate within to, t.e. the only 
element a of tọ whose scalar product with every element X of t, 
vanishes is a == 0. 

The first part of this theorem is almost self-evident. For 
if x = F(i,-++1,), the element x’ defined by %’(s) = x(sr) is 
F'(i, + + + ty) where F = rF. 

Let i be the generating unit of to, 2 an element of tọ and 
x an arbitrary element. Then since fọ is right-invariant ax 
is also in ty, whence 


ax = ax:i, tr(ax) = tr(a: xi). 


Now xi is in tọ; hence if the scalar product of a with every 
element xi of t, vanishes then tr(ax) = 0 without restriction on 
x. It therefore follows that a = 0, as asserted. 

Proof of theorem (4.6). Let E, be a basis for $, and let the 
similarity correspondence of P on P’ send E, into the basis E, 
for, P’. Let c,(i,- + + iy) be a given system of coefficients and 
write 


c= Zealty e-e dy) + Baliye > dy) (4.10) 
e = Yeali e.. iy) -Bli e. dy), 


The desired similarity correspondence between p and p’ is naturally 
to be defined by c —> c’. However, this is only possible provided 
two systems of coefficients ¢,(i,; > - + iy) which define the same 
c also define the same co’; or a system of coefficients which 
causes ¢ to vanish must also cause c’ to vanish. 

We first remark that if a tensor F satisfies the equation 


G = Sc(s):sF=0 
then also l 
G’ = Se'(s3)-sF = 0. 
By (4.10) 
c(s*) = X'S ca(ky + + + ky) Ey(ky + + + Ry), 
whence = 


Gin +++ ip) = DE clin + iyi By + + k) Ealka + + > By) 
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where 
Calti .. Pa ky: ky) = Dis (ty ss ty) * Sla(Ry ss ky). 


These c, define a symmetric transformation. Hence the given 
similarity transformation $ -> W, which sends Ex into Eg’, sends 
G into G’. This proves our assertion that the vanishing of 
G implies the vanishing of G’. : 

If c = 0 we then have 


Dic’ (s7!) + SF(ty + + + ty) = trle’s F(z, + + + i] = 0 


for all tensors F and all combinations of indices 2, + + + dy, or 
tr(c’x) = 0 for all elements x of to. Hence by the auxiliary 
theorem (4.9) c’ = 0. 

The result of our investigations is that there exists a one-to-one 
correspondence between the invariant sub-spaces P of to and the 
invariant sub-spaces $ of Rİ. This correspondence is as close 
as possible; irreducibility, complete reduction, equivalence and 
inequivalence on the one hand imply the same on the other. In 
particular, we emphasize the further consequence : 

Theorem (4.11). Every invariant sub-space P of Rİ, in 
particular Rİ itself, can be completely reduced into irreducible 
invariant sub-spaces. 

I hope that our elementary methods have made this corre- 
spondence quite apparent. 

It is evident a priori that we can completely reduce the 
modulus 1 of the algebra p into a sum e, + e, + °+- + e,, of in- 
dependent primitive idempotent elements. The formula 


F=e Fre PF+:+++:+e,f 


+ 


then gives the complete reduction of R into independent in- 
variant sub-spaces P, Pa, °° +, Pm, each of which is generated 
by one of the idempotent operators e. ($, consists of all tensors 
of the form e,F.) From this point of view we might consider 
as the only non-trivial result of our investigation the assertion 
that the P generated by a primitive e is irreducible (with respect 
to the algebra X of all symmetric transformations). Physically 
this means that the class of terms.corresponding to such a $ 
cannot be further divided into parts which cannot under any 
conditions interact with each other. If in spite of this there 
does exist such a decomposition it is accidental—i.e. attributable 
to the special dynamical situation in the case in question. 
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§ 5. Fields and Algebras 


We here interrupt our development in order to present an 
axiomatic treatment of the two fundamental concepts field and 
algebra ; our investigation has revealed the importance of these 
concepts for quantum theory. The physicist who is not par- 
ticularly interested in such a treatment may well omit these 
sections. 

A field is a domain of elements, called numbers, within 
which the two operations of addition and multiplication are 
defined and which associate with any two numbers œ, B of the 
field certain unique numbers «+ f, aß respectively. Addition 
obeys the commutative and associative laws 


atB=Bra, («+f)ty=a+ (6+ y) 


and has a unique inverse, subiraction. From this follows the 

existence of a unique number o (zero) with the property 

a--o=o-+a=a for all a Further, associated with each 

number a is a number — a, its negative, such that a + (— a) =o, 
We require that multiplication obey the associative law 


(aB)y = a(By) 


and the distributive laws 


(x + B)y = (ay) + (By), alB + y) = (aß) + (ay) 


with respect to addition. From the distributive law follow 
the relations 


xo = ox = o, 


Multiplication need not be commutative ; in case it is we speak 
of a commutative field. Further, division by any number 
other than o shall be possible and shall lead to a unique quotient, 
i.e. each of the equations 


af = 8, na=B 


have for given « +o and given B one and only one solution 
€, 9 respectively. From this it follows that the product af of 
two numbers can only be o if one of the two factors js o. Asa 
further consequence, there exists a number €, ““ one ” or ‘* unity,” 
with the property that 


QE = Eg = Qa 


for all a. We explicitly assume that not all numbers equal o: 
then in particular € +o. Every number « +o possesses a 
unique reciprocal x`! with the property «a™l = a`la = e. 
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We must introduce in addition to the numbers of our field 
the ordinary numerical symbols 1, 2, 3, - + +. Their inter- 
pretation as multipliers is given by the equations 


læ = a, 2 = a -+ a, 3% = (2a) +H a, 


in general 


3 


(n + l)a = (na) + a. 
In particular we can construct the series 
le, 2e, =» SEELE (5.1) 


of multiples of e. We then have two possibilities. (1) All the 
numbers of this set may differ from e€ ; then they are all different, 
and we can conclude with the aid of the equation 


np = ne- R 


and the division axiom that for a given number « there exists 


ne 


? 


P o4 . . . 
one and only one number 8 = z which satisfies the equation 


nB = a; we can then introduce ordinary rational numbers as 
multipliers. (2) The second possibility is that one of the multiples 
in (5.1) is equal to e itself; let the least multiple of this kind be 
pe. Then the numbers of the series (5.1) repeat in cycles of 
length p. p must be a prime number, for if p were the product 
of two integers m, n smaller than p we would then have 


o = pe = ME° NE, 


but by assumption neither me nor ne are o, for pe is the lowest 
multiple of this kind, and this is contrary to the division axiom. 
In this case we are dealing with a finite field of modulus p.* 

In order not to lose ourselves in too broad generalities we 
now take as our number domain a commutative field and define 
a linear associative algebra of finite order over this field. 
By number we mean the elements of the field, and denote its zero o 
and its unite by 0 and 1; by element we mean an element of the 
algebra. We denote the former by small Greek and the latter by 
small Latin letters. An algebra is characterized by three fundamen- 
tal operations: addition of two elements, a+b; multiplication of 
an element by a number, ya; multiplication of two elements, ab. 
The first and second of these operations obey the familiar axioms 
of vector calculus (I, § 1), which we set forth here again for the 
sake of completeness. 

Addition is commutative and associative and has a unique 
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inverse, subtraction. 


It then follows that there exists a null- 
element o. 


Multiplication by a number obeys the laws 


la a, a(Be) == (aP)c 


(x Hee (ae) + (Be), alb + c) = (a8) + (a). 


The order h is introduced by the dimensionality axiom: every 
h | ft elements of the algebra are lincarly dependent, the co- 
eHicients in the equations expressing the dependence being 
numbers of the field) but there exist k linearly independent 
elements, A set of k such elements €i, E2, °° *, én, called " basal 
units, form a basis for the algebra in the sense that any clement 
a can be expressed in one and only one way in the form 


A Kyl y fs Kyg bs sb One, 
and can be replaced by the set (a, €a °° 
ECD POTROT ES, 


Multiplication of elements among themselves obeys the 
distributive laws 


(a | be (ae) t (be), cela + b) = (ca) + (cb) 


ter both factors and the associative laws 


ya b: y(ab), b+ ya = ylba), 
(ab)c == a(bc) 


We neither assume that multiplication is commutative nor 
that it possesses a unique inverse, division, But we do assume 
that the aluebra possesses a “one,” the modulus (or principal 
at. ie, an element e with the property ae = ea =a for all 
elements a. We shall usually not hesitate to denote the zero 
and one of the elements of the algebra by 0 and 1. 

If we assume the possibility of division the algebra reduces 


tea fin gencral noncommutative) field or division algebra ot 
umir order A over the given field. 


+, %) of h numerical 


§ 6. Representations of Algebras 


bir the sake of the printer and in order to give the text a 
more pea tiul appearance We no longer emphasize the elements 
nf our algebra by expressing them in boldface type. . This 
apples in particular to the elements of the algebra p of “ sym- 
metry quantities "which we may often denote by this latter 
eepresdant in vase of possible confusion with the elements of 
the underlying group. We still employ this means of distinguish- 
mig between the tensor # and the symmetry clement F or when 
we wie to consider an element as an operator acting on a tensor. 
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We start with an algebra p of finite order h, the elements of 
which constitute an k-dimensional vector space t, and associate 
with the element æ of p the correspondence 


(a): x—->x' = ax 


of r on itself. We consider the algebra (p) of transformations 
(a), which is simply isomorphic with the algebra p, as funda- 
mental for the vector space r, i.e. the term reducible, invariance, 
etc., as applied to sub-spaces of r are with respect to the 
group of transformations (a). We assume that t can be com- 
pletely reduced into irreducible sub-spaces Pı + Pa +--+; each of 
these sub-spaces then contains an idempotent generating unit 
€;,€,,°°*. We have already seen that this assumption is true 
for the algebra associated with any finite group—at least under 
the restriction that the field over which the algebra is defined 
does not have as modulus a prime number which is a factor of 
the order 4 of the group. 

We discussed the representations of a group or of the corre- 
sponding algebra in Chapter III. We found that the irreducible 
representations are subject to certain important conditions 
which, surprisingly enough, limit their number and which, 
together with the as yet unproved ‘‘ completeness theorem,” 
lead to the reduction of the given algebra into independent 
simple matric algebras (III, § 13). That we were unable to 
prove the completeness theorem with the methods there em- 
ployed was to be expected, for we assumed that the representa- 
tions were given and examined their properties; we had no 
general process for the construction of representations of the 
given algebra. But we are now in possession of the materials 
for such a construction: the reduction of t into irreducible 
sub-spaces p; reduces the regular representation into as many 
inequivalent irreducible representations of our algebra as there 
are inequivalent invariant sub-spaces p, We shall now carry 
out this construction process to the point of obtaining the re- 
duction of our algebra into independent simple matric algebras ; 
it will be desirable to derive the previous results again from this 
standpoint. A further difference between this investigation 
and that of Chapter ITI consists in the fact that we here refrain 
as long as possible from placing restrictive assumptions on the 
commutative field over which the algebra is defined; only at 
the end of the investigation do we discuss the advantages at- 
tributable to the fact that the continuum of complex numbers, 
the only field in which we are interested for the physical appli- 
cations, is algebraically closed. 
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Theorem (6.1). Every representation of the algebra p is com- 
pletely reducible into irreducible representations. Each of these 
irreducible constituents is equivalent to the representation induced 
in some Y, by the regular representation. 

(Hence the complete reducibility of the given algebra implies 
the complete reducibility of its representations. Further, every 
irreducible representation is contained in the regular repre- 
sentation, which therefore constitutes an appropriate starting 
point for obtaining all representations by the method of reduction). 

Let H be an n-dimensional representation, and let €,, êa °°°, 
€a be n fundamental vectors constituting a co-ordinate system 
in the representation space R of H. If the element a of the 
algebra corresponds to the linear correspondence A in Q, we 
interpret the equation 


y= ag as r= Af, 


where g’, r are vectors in R. Ife is a given fixed vector and x 
runs through all elements of one of the irreducible invariant 
sub-spaces p= p, of r then, as we shall show immediately, 
xe runs through a certain sub-space p(e) of R which is invariant 
with respect to H. Indeed, the transformation A associated 
with an arbitrary element a sends xe over into (ax)e, and if 
xis in p, axis also. p(e) is either 0 or is similar to p in the sense 
that different x generate different images xe, for those x of p 
for which xe = 0 constitute an invariant sub-space p’ of p, and 
in virtue of the assumption that p was irreducible p’ must 
either be 0 or p itself. Hence if p(e) +0 the representation 
induced in p(e) by © is equivalent to the regular representation 
in p. 

These considerations are to be supplemented by the following 
remark. If% is any invariant sub-space of R then p(e) is either 
independent of $ or is contained entirely in $, for those elements 
x of p for which xe lies in Ẹ constitute an invariant sub-space 
of p, which is therefore necessarily either 0 or p itself. 

Now construct successively 


Plex), Pele) ° °°, 
Pi (Es), Do(@s), sey 


Palen), | Palen), 


Each sub-space in this list is either entirely contained in the 
sum of the previous ones or is independent of this sum; on 
retaining only those sub-spaces for which this latter possibility 
is realized we obtain a reduction of R into certain invariant 
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sub-spaces ),(e,). To prove this theorem we need only to note 
that the sum of the sub-spaces contained in the first row con- 
tains at least the vector €, that on adding to them the sum of 
those contained in the second row we obtain at least the vector 
ê, in addition, etc.® : 

The theorem just proved is in particular applicable to the 
symmetric group 7, and we now wish to establish the analogue 
for the algebra X of symmetric transformations in the space / 
of tensors of order f. We already know that Rf can be reduced 
into sub-spaces $; which are irreducible with respect to 2 
(provided the number field over which 2 is defined does not have 
as modulus a prime Sf). Every transformation A of & is at 
the same time a transformation A, of $; on itself and the corre- 
spondence A— A, is naturally a representation of 2, the 
‘ representation induced in $, by the algebra 2.’’ We wish to 
show that the representations of 2 are completely reducible 
into irreducible constituents, and that each of these constituents 
is equivalent to the representation induced in some $, by the 
algebra 2. Naturally this does not follow immediately from 
theorem (6.1); in order to establish the connection between 
the two we must show that the complete reducibility of R’ into 
irreducible invariant sub-spaces P; implies the same for the 
algebra 2. We apply the notation and conventions given at 
the beginning of this section to the algebra 4: (A) is the 
correspondence 


S—> S = AS 


of the ‘‘ vector space’’ 2 on itself, A -> (A) the regular repre- 
sentation of X; the algebra of transformations (A), which is 
simply isomorphic with Æ, is taken as fundamental in the vector 
space X, i.e. the transformation group of 2 consists of the 
transformations (A). 

Theorem (6.2). Let E be an algebra of transformations in a 
vector space R, and let R be completely reducible with respect to 
this system & of transformations into irreducible invariant sub- 
spaces B;. Then X is itself completely reducible into irreducible 
invariant sub-spaces II;, and the representation induced by the 
regular representation in II, coincides with (more precisely, 1s 
equivalent to) the representation induced in one of the irreducible 
P, by the algebra X itself. 

This theorem holds without any restrictidns on the field 
over which 2 is defined. Let JZ be an irreducible invariant 
sub-space of » (consisting not merely of the transformation 0), 
and let R+0 be a transformation of M. There then exists 
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a vector a in R such that Ra +0. Let a be decomposed into 
its components a, in the various sub-spaces §8,; at least one of 
these components, say a, = e, must be carried over into a vector 
Re +0 by R. We now hold e fixed and let S in 3 = Se run 
through all transformations of JI; these 3 then constitute an 
invariant sub-space J(e) of P = P, The “typical reasoning ” 
already applied in the proof of the previous theorem then allows 
us to conclude that : . 

(1) J(e) is either 0 or $, as $ is irreducible; in this case 
it is necessarily $, for the vector Re = 0 belongs to M(e). 

(2) S=0 is the only transformation in JZ which sends e 
over into 0, for those S of II for which Se = 0 constitute an 
invariant sub-space of the irreducible sub-space JJ. Hence 
38 = Se sets up a one-to-one correspondence between JI and $. 

This correspondence is similar, for S’ = AS implies that 
the vectors 8 = Se, 3’ = S'e satisfy the equation 3’ = 48. We 
have thus proved the second part of our theorem: the repre- 
sentation induced in IZ by the regular representation coincides 
with the representation induced in $ by the algebra itself; 
briefly, J is similar to some §;. 

Since Se runs through the entire sub-space P when S$ runs 
through H there exists an Æ in I such that Ee = e; then 
E?e = e. Since the transformations E and E?! of JZ both 
associate the same image with e they are identical: E is idem- 
potent. Hence 2 can be completely reduced into two inde- 
pendent sub-spaces JI + Z” in accordance with the formula 


S = SE + (S — SE). 


[Cf. the proof of Theorem (3.3).] Successive application of 
this procedure leads to the complete reduction of 2 into its 
constituents IZ. 

Having proved Theorem (6.2), we obtain from Theorem 
(6.1), under the same assumptions, the further theorem : 

Theorem (6.3). Every representation of X is completely 
reducible into irreducible representations. Every irreducible re- 
presentation of & coincides with the representation A-—» A, 
induced in some $, by the algebra = itself. 

Theorem (6.1) yields the further (rather uninteresting) fact 
that not only is every JI, similar to some $, but also conversely 
every $; is similar to some JJ,. 

As has already been indicated, all of these results are applic- 
able to the algebra of symmetric transformations in tensor space 
R. But we have shown in § 1 that this algebra can be replaced 
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by the group (¢)J induced in tensor space by the group ¢ of 
linear transformations 7 


? 
x; epee 


IMs 


a(ik)xr [det [a(zk)] + 0] (1.3) 


of n-dimensional vector space, i.e. by the representation (c)F of 
c. We shall say that a representation of ¢ is of order f if the 
components of the matrix A, which corresponds to the element 
(1.3) of the group, are rational integral functions of the a(tk) 
of order f. Our theorem then asserts : 

Theorem (6.4). Every ft order representation of ¢ is com- 
pletely reducible into irreducible representations, and every irreduc- 
ible representation of order f of ¢ is contained in the representation (c)f. 

This theorem. is still valid on restricting the affine group ¢ to 
its unitary sub-group u. (Naturally the concept “ unitary '’ im- 
plies that we are then no longer dealing with an arbitrary field, 
but are operating in the field of all complex numbers.) 


§ 7. Constructive Reduction of an Algebra into Simple 
Matric Algebras 


We again assume that the algebra p of order h, which may 
at the same time be considered as a vector space t of k dimensions, 
is completely reducible into irreducible invariant sub-spaces p,. 
The generating units e; of these irreducible p, are obtained by 
the corresponding reduction of the modulus; we can then 
express an arbitrary element x of r as the sum 'of its components 
in the various p; : | 


l= Oe: (e; in b,), x= L Xli (7.1) 


If q is a sub-space of r we denote by qa the totality of elements 
of the form xa where x runs through all elements of q; e, with 
or without index, is an idempotent element, usually primitive ; 
p = te the invariant sub-space generated by e; § the repre- 
sentation of p induced in p by the regular representation. 

We could consider in addition to the reduction (7.1) of t 
into left-invariant sub-spaces the analogous reduction into 
right-invariant sub-spaces by means of the equation 


x= D> eX. 
t 


But the most complete separation into mutually independent 
components is obtained by carrying out both of these processes 


simultaneously : 
a= Rieter == Atir (7.2) 
t, t 
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The elements of the form eve, are those of character (e,, ex), 
or briefly (ik). Let py, be the sub-space consisting of all elements 
of this character. The various P,» are independent and the 
entire r is reduced into the sum of the Pir; the original left- 
invariant Pa =2p,,. The important properties of p,, are given 


by the following : l l 

Auxiliary Theorem (7.3). I. If p, p are two inequivalent 
irreducible sub-spaces with generating units e, e', all elements of 
character (e, e’) are = 0. l N 

II. The elements of character (e, e) constitute a field or division 
algebra which is simply isomorphic with the system of similar 
projections of p on itself. f 

Proof. I. Let a be any element of character (e, e’). The 
transformation 

[a]: x —> x’ = xa (7.4) 


carries every element x of p over into an element x’ of p’ and 
defines a similar projection. Conversely, we know that any 
similar projection of p on ’ is defined by an equation of this 
form, and that the generating element a of character (e, e’) is 
uniquely determined by the projection. If p and p’ are irre- 
ducible our “ typical reasoning’’ leads us to the two usual 
alternatives : either the projection associates with every clement 
x of p the image x’ = 0 or it defines a one-to-one correspondence 
of ponp. The equation ea = a tells us that the first alternative 
is possible only if a= 0, and the second implies that p and p’ 
are equivalent. 

II. The above remarks are applicable to an clement a of 
character (e, e) and the similarity projection of p on itself which 
it generates. If p is irreducible every such projection, except 
the one defined by a= 0, is one-to-one and consequently has 
an inverse. But the existence of an inverse is identical with 
the possibility of division. The isomorphism asserted in the 
theorem is apparent on reversing our usual procedure, and 
reading the resultant of two or more correspondences from 
left to right, for the resultant of the correspondences 

x’ = XA, x = xa’ 
is given by 
x” = glaa’). 


We now proceed with the help of this auxiliary theorem as 
follows: Arrange the p; into classes of equivalent sub-spaces 
with generating units 


é Ss e œ Ps ” ” 
ĉi, » êri êt G) 
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and add together the generating units in each of these classes : 
e +: . "Hee =E, eat: . oe ee 
We then have 
Lee pe” peee (7.5) 
rere: (7.6) 
where tr’, t”, +- denote the inequivalent sub-spaces te’, te”, -> 
into which t is reduced. 
Part I of the auxiliary theorem above then tells us that, 
for example, 
exe” = O. 
Hence the product a'a” of two elements belonging to different 
sub-spaces t’, r” is always 0, and the reduction 


a=ata’ t+ ++ sae tas’ +: 
leads to the multiplication rule 
ab = a'b' + ah" 4- e. 

From this it follows that r’ is both right- and left-invariant and 
a fortiori constitutes an algebra p’ (“ invariant sub-algebra’’) ; 
e’ is the modulus of p’. The given algebra ts then the direct sum 
of the simple algebras p', p”, +, where the precise meaning of 
direct sum is defined by the following : 

Let p’, p”, - be algebras (defined over the same field), and 


consider as the elements of a new algebra p, the direct sum of 
p’, p”, ++, all sets 


a= (af, a", >>) 
consisting of an arbitrary element a’ of p’, an arbitrary a” 
of p’,::+. The fundamental operations in p are defined by 
(a’, a’ o. -) + (D’, b", . +) = (a’ -- b’, a" + b”, . -), 
: Na’, as) == (a; ha", SREO a 
(a’, a”, e. a (b, b”, e. J) = (a’ bt a'b, >. 2) 


where A is any number. 

Note that the central of the algebra p obtained by direct 
summation is the direct sum of the centrals of the individual 
algebras p’, p”, 

We investigate in detail one of these simple sub-algebras, 
say p’, which we now denote simply by p; its modulus e’ 
may now be denoted by 1. On omitting the primes, the de- 
composition of 1 into equivalent primitive idempotent elements 
e; is expressed by 


L=eqtet-- +e, 
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Every element a of p is reduced in accordance with the formula 
(double Peirce reduction) 


T 
a= Xai = (ees) 
ik= ik 

into components of characters (ik). The component cix of the 
product c = ab is easily seen to be expressed in terms of the 
components a;r, bix of a and b by the equation 


f 
Cik = P% A:jD ix: 


We have thus already obtained the connection between our con- 
siderations and the matrix calculus. 

The invariant sub-spaces p,, Po, °° ‘, Pr generated by the 
é1, €>, * * *, €, are all equivalent. Let p be any of these classes, 
e.g. p = p, and let I’; be any fixed one-to-one similarity corre- 
spondence of p; on p. In accordance with (7.4) any element 


a= Ay, = €1de, 


of character (e; e,) generates a similarity projection [a] of p; 
on P; this projection can be written in the form 


[a] = Poly (7.7) 


where « is a similarity projection of p on itself. But by Part II 
of the auxiliary theorem proved above the similarity projections 
of p on itself constitute a field (division algebra) ® which is simply 
isomorphic with, the set of elements of character (e, e). If @ is 
of order v each of the r left-invariant sub-spaces 


Pr = Spa 
i=] 


is of dimensionality g =r'v. The number of times r an irre- 

ducible representation occurs in the regular representation is 

accordingly a factor of the dimensionality g of the representation. 
Any element a can be reduced into its components a, 


which may be any elements of the independent sub-spaces Piz- 
In accordance with (7.7) 


[tir] = Dol r (7.8) 


and a; may be replaced by the corresponding element œ; of 
the field ® Since conversely any such element «,, is by (7.8) 
associated with a similarity projection [a,,] of þ; on Pr, and there- 
fore with a definite element a,, of character (ik), we obtain 4 


i. Ae -a  - 


REDUCTION OF AN ALGEBRA . 313 


one-to-one reciprocal correspondence between the totality of 
all elements a of the simple algebra p and the totality of matrices 


Xir Xia °° * Oy, 
Xol Azz * * * Aar (7.9) 


[Sry Ara * ° One 


of order r whose components «;, are elements of the field ©. 
The correspondence is such that to the three fundamental 
operations of the one (addition of elements, multiplication of 
an element by a number and multiplication of two elements) 


correspond to the same operations of the other. Note that in 
particular 


CAJA a [a;;] [Bix] =P 506551" ji eae Bid E i 
= I;- AiP Dy. 
We have thus proved : 

_ Wedderburn’s Theorem.’ Any of the simple algehras, whose 
direct sum constitutes the given algebra p, is simply isomorphic 
with a simple matric algebra in a certain field (division algebra) 
Ọ defined over the field of the original algebra. 

(Remark. The invariant sub-space p, consists of all elements 
a such that the matrix ||«,,|| has as its only non-vanishing column 
the k. The element e; is then described by that diagonal 
matrix all of whose components vanish except the one occupy- 
ing the ¿è place, which is 1.) 

It is readily seen that the central of the simple algebra p 
consists of those elements whose matrix (7.9) is of the form 


a 0...0 
0 a:-::9O 
o Ocaescce 


where a belongs to the central of the field ©. 

Our construction was divided into two steps. First t was 
completely reduced into the sub-spaces r’, t”, + + + which are 
both right- and left-invariant and then these were further 
reduced into the left-invariant sub-spaces p; We must now 
return to the consideration of the first step. On multiplying 
xe’ on the left by (7.5) we find 


xe! = exe’, 
and on multiplying e’x on the right by the same factor 
ex == E'xE', 
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Hence 
xe’ = ex; 


the e', e”, +--+ commute with all elements and belong to the central 
of the algebra. The sub-spaces t = p’, t”, +-+ are both right- 
and left-invariant in the sense that neither the transformation 
x' = xa nor x’ = ax leads out of them, and they are furthermore 
irreducible in this respect—indeed, it is for this reason we call 
them “ simple.” In order to show this we proceed as follows : 

(7.10). If ty is a sub-space which is both right- and left- 
invariant then either e; is contained in tọ or tye; = 0. For 
Yo é; is an invariant sub-space of the irreducible p; and is there- 
fore either 0 or p, itself. In the second case we have 


P: = Toe: -3 To 


since Y, is right-invariant ; hence e; is contained in to 

(7.11). If e; is in tọ the same is true of any e which is equi- 
valent to e, For the similarity projection x’ = xb of p; on p 
associates e with some element a; of p; by means of the equation 
e = a,b, and since a, is in Tọ g is also. 

(7.12). If tẹ -3 r then since ty = Xrge; not all the tae; can 

v 

be empty, i.e. one of the e; must occur in tg. But they must 
then all occur in tg, hence also e’ = Sve., and consequently 1, = T. 


(7.13). Again let tọ be a right- and left-invariant sub-space. 
Then either te= r’ or it is empty; in the former case £’ is 
in tẹ It follows from 


To = Tye’ + rye” +: +: 


that t, is necessarily the sum of certain of the spaces t’, t’, +++; 
when in particular ry is irreducible in the sense of right- and 
left-invariance it must coincide with one of the t, t”, =.. 
Hence the reduction (7.6) is unique. This further shows that 
every right- and left-invariant sub-space ty possesses a generating 
unit 2 which belongs to the central of the algebra, and that t 
can be completely reduced into tg and a supplementary right- 
and left-invariant sub-space. 

(7.14), If p is an irreducible (left-) invariant sub-space with 
the generating unit e, then pe’ is invariant, and since pe’ = ep 
it is either 0 or p itselt. Since 


p= pel + pet... 


the equation pe = p must hold for some one of the e’, e”, + =», 
while for all others pe = 0, We then say that e belongs to p 
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and that conversely e or p belongs to e. p is a sub-space of the 
right- and left-invariant te. 

An algebra p = r, concerning which we only assume that it 
is completely reducible into irreducible invariant sub-spaces p,, 
is necessarily obtainable by successive application of the follow- 
ing processes : 

(A) Construction of a field ; 

(B) Transition to matrices: we take as elements the matrices 
of a fixed order r whose components are arbitrary elements of 
the field ; 

(C) Direct summation. 

The processes (B) and (C) are formally completely determined 
and are therefore of an elementary character. Hence the 
construction of algebras is reduced to the construction of fields, 
i.e. of special algebras in which division is possible (“ division 
algebras ”). 

The converse is naturally also true: any algebra constructed 
by the three steps (A), (B) and (C) is completely reducible, for: 

(A) If the algebra r is itself a field, rt is itself an irreducible 
sub-space of r. For if « is any non-null element of the field 
then æ runs through the entire field with é; this is merely 
the content of the division axiom. 

(B) The matrices (7.9) in which all components of every 
column except the 7“ vanish constitute the irreducible sub- 
space p; and the space r of all matrices is the sum of these },. 
p: is irreducible; to show this we must prove that if a is any 
element in p; then any element of p; can be expressed in the 
form xa. a as well as a’ = xa has as its only non-vanishing 
column the 78; dropping the last index 1, we denote these two 
columns by 


(a, Xe °° 4%, Ar), (a, ap, sy r), 


respectively. The equation a’ = xa is then 
, r 
a; = D Eira | 
k=l 


we are therefore concerned with proving the theorem that any 
non-vanishing ‘‘vector’’ (aja, °° + o,) can be transformed into 
any given “vector” (ajx, «,) by an appropriate linear 
correspondence. Since not all the «, vanish take one of them, 
say a, which does not vanish and let all é; for which k = 2 
be 0; &, is then to be determined by the equation 


a; = Eits ; 
that this is possible is guaranteed by the division axiom. 
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(C) The assertion is self-evident for this step. 

In general only the first step, (A), does not lend itself to an 
exhaustive formal treatment. However, if the field over which 
the field (“ division algebra”) referred to in (A) is defined is 
algebraically closed this step becomes extremely simple : 

The only division algebra of finite order over an algebraically 
closed field is this field itself. 

Proof. Consider an algebra of order v defined over an 
algebraically closed field. If @ is an element of the algebra 
there must exist a linear dependence between the v + 1 powers 
a’, al,» ++ a l, i.e. a linear relation whose coefficients are 
numbers of the field. Hence æ satisfies an algebraic equation 
of degree m Sv: 


fla) — Am + y AmI + e.. + Ym 
fla) = qm yami + e.. Yml = Q. 


Since the field is algebraically closed f(A) can be expressed as 
the product of linear factors : 


fd) = (X— ay)(X— a4) + + A — an). 
Correspondingly 


(a — al)(a — al) > - + (a— aml) = 0. (7.15) 


We now introduce the assumption that the algebra of order v is 
a division algebra; then the product of two or more elements 
can vanish only if one of the factors is 0. Hence we may con- 
clude from (7.15) that a = «,1 for some 71; the algebra then 
consists of the products of the modulus 1 with any number of 
the fundamental field, and therefore the algebra itself is simply 
isomorphic with this field. 

If we are dealing in the field of all complex numbers the 
auxiliary theorem (7.3) can be replaced, in accordance with 
the above, by the more definite: 

(7.8’). All elements of the form ex'e are zero if the primitive 
idempotent elements e, e' are inequivalent. If they are equivalent 
all such elements are multiples of one of them (which is different 
rom 0). 

ú Further : The number of times an irreducible representation 
appears in the regular representation is not merely a factor of the 
dimensionality of the representation ; it is actually equal to it. 
Our analysis has thus revealed the true source of this remarkable 
fact. i 

Under these circumstances the given (“ semi-simple") algebra 
is the direct sum of simple matric algebras over the original field. 
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We obtain a complete set of basal units Eit ea” 
| a= Lente + Lae + (7.16) 
for the algebra; these basal units satisfy the multiplication 
law of ‘‘ matrix units,” i.e. products of the type 
| Cul = éit (7.17) 
and all others vanish. The correspondences 
a — |lagll, a> Jaxl °° 


are the inequivalent irreducible representations }’, 9”, -:°. 
The basal units ej, e”,: + + are the generating units e; e," 
of the irreducible sub-spaces p; with which we began our con- 
struction. eœ is the element of character (ik) generated by 
the correspondence IJ" of p; on px, i.e. that element which 
this correspondence associates with ej. 

After having obtained the irreducible representations in 
this constructive way we derive their orthogonality properties 
again from our present standpoint. For the moment let the 
trace of a denote the trace of the correspondence 


x —> y = ax (7.18) 


of t on itself which is associated with æ in the regular repre- 
sentation. In terms of the co-ordinate system defined by the 
basal units above this correspondence becomes 


gy’ 
Nik = Zui s. 
Each of the g’ columns of variables 


Eun, fox, re Eon (k = I, 2, ee g') 


undergoes the transformation with matrix |lo,||; the trace of 
a is accordingly 


g’ 
1 
g Atir +’ 


By (7.16) this is equivalent to the equations 


; O (i +k 
wa) [LEER 


for the basal units. Hence by (7.17) 
tr (ere) =g, (7.19) 
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and all other types of products of basal matric units have a 
vanishing trace. 

If the algebra is the algebra of a group of order h the corre- 
spondence (7.18) is expressed in the original co-ordinate system, 
consisting of the elements s associated with the elements s of 
the group, by the equation 


y(s) = Jia(st™) xt). 


From this it follows that the trace, as defined above, of a is 
equal to k» a(l); but in the case of a group algebra we have 
previously called a(l) itself, without the factor h, the trace of a. 
On returning to this original definition of the trace we need 
merely to replace the right-hand side g’ of the orthogonality 
relations (7.19) by g’/h. Equation (7.16) may now be solved 
explicitly for the coefficients : 


t r h , 
Ot, = mtr (aer) = Xals) Epil). (7.20) 


The connection with the development in Chapter III, § 13, is 
obtained by noting that the 


Uls) = ž eals?) (7.21) 


are the components of the matrix U’(s) associated with the 
element s of the group in the irreducible representation Ñ. 
The character of H’ is therefore 
d 
x'(s) = . e'(s=) (7.22) 
and (7.19) yields the orthogonality. relations for the representa- 
tions. 

We have thus arrived at a constructive formulation of the 
theory, in which the fundamental concepts involved in and the 
range of validity of each step are clearly apparent. It supplies 
us with a constructive method for obtaining a complete set of 
irreducible representations, as well as establishing the ortho- 
gonality relations. 


Additional remark. In dealing with the continuum of all complex 
numbers and a group algebra defined over this field we can, in accord- 
ance with the remark at the end of § 3, completely reduce the modulus 1 
into real primitive e; and the space r into the corresponding unitary- 
orthogonal irreducible p, Further, the projections T; can be normalized 


? 


in such a way that ¢,, is conjugate to ep. To show this we note that 
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the conjugate of ey is under these conditions an element of character 
(ki) and must therefore be the product of ep; by a number yp : 


Cin = Van” pee (7.23) 
The rules ., a a 
Efke T Cir Cu S Onik 
yield the conditions 
YikYki = Yip Ya = 1 


on the coefficients. Further, y,, is real and positive, for from (7.23) 
and (7.19) we find 


/ 


‘ Po oml 
Zlece(s)|* = treta) = E. Ya 


We then find that the y,, can be brought into the form y,, = 22/22, where 
the 8, are positive real numbers (take, for example, p: = y,,). On re- 
placing the original correspondences T, by £;T; we find that the new e'ri 
is actually conjugate to the new eip Our representations ý’, }', --- are 
accordingly thrown into unitary form, 


B. EXTENSION OF THE THEORY AND PHYSICAL APPLICATIONS 


§ 8. The Characters of the Symmetric Group and 
Equivalence Degeneracy in Quantum Mechanics 


The notation employed in this section is as follows: m = my 
is the symmetric permutation group of f things, rt = p = (r) 
the corresponding algebra, e a (primitive) idempotent element 
of p, p = te the (irreducible) invariant sub-space of r generated 
by e, } the representation induced in p by the regular repre- 
sentation, g the dimensionality of p and h, x the character of 
h, e that element of the set e’, £”, - - - (7.14) to which the irre- 
ducible p belongs; P the corresponding symmetry class of 
tensors of order f, consisting of all tensors of the form êF, © 
the representation’of the algebra 2 of symmetric transformations 
(and therefore of the linear group c) which is induced in $ by 2 
itself. When further differentiation is necessary, we also denote 
this by (x) or n(x). In case the considerations are valid 
for an arbitrary finite group 7, h denotes the order of m (= f! 
for Ty). 


Determination of the Group Characters. 


We begin by calculating the character of the representation h. 
To this end we construct the trace of the linear correspondence 


x —> y = ax (8.1) 
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of p on itself; the considerations of the previous section show 


it to be 
Zals)x(). 
Now consider instead of (8.1) the projection 
x —> y = axe (8.2) 


of the total space t on p; it coincides with (8.1) within p and 
sends any element x of t into an element y of p. On choosing 
the co-ordinate system in t in such a way that the first g funda- 
mental vectors span the sub-space p, the last h — g rows of the 
matrix of (8.2) consist only of zeros; hence the trace of the 
projection (8.2) of the total group space is equal to the trace of 
the correspondence (8.1) inp. In terms of components equation 
(8.2) is 

y(s) = Lia(t)x(s’)e(t’), (ts’t’ = s) 
and the trace is therefore 


Z Eafe) 


where the inner sum is extended over the pairs t, ?’ of elements 
of the group which satisfy the equation tst’ = s, or explicitly, 


the trace is 
zial) Zes )}. 
Hence the character x of } is given by 
x(t) = Le(s ts) 
or 


x(s) = Xelrs r). (8.3) 


r 


In particular, the dimensionality g of the representation Ñ (and 
the space p) is 
x(1) = h- e(l). 


Resonance or Equivalence Degeneracy. 


The significance of our results for quantum mechanics, as 
first recognized by Wigner, is the following.” The complete 
reduction of the tensor space R into invariant sub-spaces $; 
implies a separation of the terms of the physical system J/, 
consisting of f equivalent individuals J (electrons), into sets of 
terms which no dynamical influence whatever can cause to 
enter into combination with each other. We have further seen 
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eee Seat a R into the P; parallels the complete 
he total group space t of the symmetric permutation 
group 7 into invariant sub-spaces p, Hence there is a system of 
terms associated with every irreducible representation of a— 
which we denote simply as the term system x, using the 
character x of h as a name for the system—and the multiplicity 
of this term system is the number m(x) of times that ) occurs 
in the regular representation. This suffers a slight modification 
in case n < f, for we must then ignore all p; which are not con- 
tained in ra = IR. But since ry is both right- and left-invariant 
all sub-spaces which are equivalent to an irreducible invariant 
p lying in tọ are also in tọ» Hence the multiplicity of the term 
system x iS m(x) or 0 according as that e with which the character 
x is associated by (7.22) is in t, or not. From the physical 
standpoint, the only additional fact of interest obtained from 
the more extended theory built up on the assumption that the 
number field in which we are operating is algebraically closed 
is that then the multiplicity m(x) is equal to the dimensionality 
g of the representation }. Furthermore, it is impossible to 
resolve this multiplicity by any physical means whatever, for 
corresponding terms in these various term systems remain in 
coincidence under all dynamical influences. 

We consider the resolution of terms in the case in which the 
interaction between the f individuals is expressed by a small 
perturbation energy AW, neglecting higher powers of the small 
parameter A. Assume for the moment that the energy levels 
E, Ex, `> © of a single individual I are non-degenerate. On 
neglecting the perturbation If possesses energy terms of the type 


E = By+ Bytes Eri (8.4) 


we first concern ourselves with such a term. lts multiplicity 
is f! and the corresponding co-ordinates in tensor space are the 
coefficients F(i,, ia °° * iy) whose indices are any permutation 
sof 1,2,°°°,f. This coefficient F(iyi, + + + i) is the component 
x(s) of the element 


x == F(1, 2, °°, f) 
of the algebra (7). The separation of the term (8.4) is to a first 


approximation determined by the reduction of the correspon- 
dence 


Fig °° ty) = Zalha -ee ty; kikat hs) F (Rake +++ Ry) 


to diagonal form ; here the matrix of the coefficients a represents 
the energy and ty, ta, °° '; ig; ky Rey °° ky are permutatic 
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s, tof 1, 2,-++, f. This equation may therefore be written in 
the form 


t(s) = Sia(s, t) x(t). (8.5) 
The equation 
aliy +s ip; Ry st + Ry) = ality ss ty; Ry + ky) 
describing the symmetry of a, in which 
lol, +++ fof 
s any fixed permutation 7, is expressed by 
a(sr, tr) = a(s, t) 


for the only coefficients in which we are here interested ; r is here 
considered as applied to the indices 1, 2,+--, f themselves rather 
than the sub-indices. Hence a(s, é) depends only on st7!: 


a(s, i) = (st), 

and equation (8.5) may now be written in the abbreviated form 
(a): È= ax (8.6) 

where a, x, ž are the symmetry elements of the algebra (mr) with 
components a(s), x(s), £(s). 

On restricting ourselves to an invariant irreducible sub-space 
P of the system space Rf the element x of (7r) lies in the corre- 
sponding p. The g terms W,, Wa, +++, W, into which (8.4) is 
resolved by the perturbation and which belong to the term 
system x under consideration are, to the approximation involved 
in the perturbation theory, the characteristic numbers of the 


correspondence (8.6) of p on itself. The sum of these terms must 
therefore equal the trace of this correspondence, or 


Wi + Wit: -+ W, = Sals)x(s). (8.7) 


The sum of the squares of these terms, of their third powers, 
etc., are obtained by reiterating the correspondence (a), i.e. 


Wit Wi + +++ + Wi = La,(s) xls), (8.7°) 
where the a,(s) are the components of the symmetry element 
a’: 

a(s) = 1 or 0, according as s = l or + |, } 


t(s) = La,(st)a(t). (8.8) 


As soon as the “exchange energies” a(s) are known we can 
apply this formula to calculate those of the terms arising from 
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(8.4) which are contained in th l - 
only to know the a. = Sc ae ors ened 


we : —it is not necessary to have 
explicit expression for the idempo = ne 
representation h of v. potent generator e or the 


These considerations are immediatel 


ignore the spin phenomena. y applicable only if we 


If we take into account the 
ee er- 
turbation due to the interaction of the electrons before ee 


due to the spin, as in the case of nor 

l i, as In mal term order, the mere 
existence of spin implies that each of the energies F, is at least 
two-fold. We shall later concern ourselves with the far-reaching 
modifications caused by the spin and by the Pauli exclusion 


principle, which enables us to discard the maiori 
terms. jority of possible 


The unperturbed If will have, in addition to terms of the 
type (8.4), terms in which groups of two or more summands 
appear with the same indices. The multiplicity of the term 


Ey + fE te HAE, tfit: +f=f) (8.9) 
with integral non-negative weights f; is but 
fa 
m (8.10) 


The corresponding tensor coefficients x(s) are those obtained 
from 


F(LL- ++; 22ers) 
Sennen som? N sete enone” 
fi fz 


by the permutations s of the f arguments. But a permutation 
p is without effect if it only permutes the first f, indices among 
themselves, the next fa among. themselves, etc.; we may no 
longer distinguish between the permutations s and ps—they 
must be considered as giving rise to but one component. Such 
permutations p constitute a group m = n(fu fa - * +) of order 
h’ = filfal ©», and two permutations s, t are to be considered 
as the same if they are left-equivalent with respect to this sub- 
group a’, i.e. if s = t (ps = t, where p is an element of a’). The 
only elements x of the algebra (7) in which we are now interested 
are those which satisfy the equation 


x(t) = x(s) when ¢ = s (mod. 7’); 
they constitute a linear sub-space t’ = t(n’) of dimensionality 
(8.10). More precisely, t’ is a right-invariant sub-algebra, for 
if s = tthen also sr = tr. Again a(s, t) = a(st™); further 
a(ps) = a(s), (sp) = als) 


if p is in a’, 
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We are now concerned with the correspondence x > # in 1’: 
a(s) = Dia(st“1)x(t) (mod. 7’), (8.11) 
t 


é 2? 


where the “ mod. 7’ ” indicates that both s and ¢ run through 
a complete set of elements of the group which are inequivalent 
mod. 7’. As x runs through t’, xe generates a sub-space p’ of 1’ 
which is transformed into itself by the correspondence (8.11), 
and the reduction of this correspondence of p’ into diagonal 
form yields those terms arising from (8.9) and lying in the term 
system x. The trace of (8.11) in p’ is equal to the trace of the 
correspondence A,: x— in t’ which is obtained from (8.11) 
by replacing x by xe, i.e. x(t) by 


L(y ely) = Zx(rje(r™). 


Hence 
tr(A.) = J {a(st™) De(r-t)}. 
s,tmod. x r=s 
Since a(st~!) = a(rt“!) when r = s (mod. 7’), this trace may be 
written 
2 La(ri)e(r-%). 


Naturally this sum does not depend on which particular element 
t we have happened to choose from the set of group elements 
which are equivalent mod. 7’; hence on dropping the restriction 
on the range of t the above sum is multiplied by the order h’ 
of r: 


(A) = Zae) = pasl) (819) 


Here again x(s) is the character of ) as determined by (8.3). 
In particular, the dimensionality of p’, i.e. the number of terms 
in the system x arising from (8.9), is obtained by replacing the 
symmetry element a in (8.12) by the element a, defined by 


o(s) = 1 or 0, according as s = | (mod. 7’) or not: 
this number is consequently 


x(s) 


h’ ee 


(8.13) 


We express this result, the validity of which is not restricted 
to permutation groups, in the theorem : 

Let 1’ be a sub-group of n of order h' and let p be a left-invariant 
sub-space of the group space t of m. Consider the elements x of 
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the algebra (n) which satisfy the condition x(s) = x(t), where s and 
t are any two elements of the group w which are left-equivalent 
mod. a’; the elements of (m) which are of this type and which 
lie in p constitute a linear sub-space whose dimensionality is given 
by (8.13), where x is the character of the regular representation in p. 

The sum of the terms is equal to the trace (8.12), and the 
sums of their powers are given by 


Ea(s}x(s) 
ZW = FR (8.14) 


The only way this result differs from (8.7’) is by the introduction 

of the denominator /,!f,! +--+ and the fact that a,(s) is now defined 

by , 

Q,41(S) = Lia,(st“4)a(t) (mod. 7’). 
t 


Degenerate Case. Denote the numerically different energy 
levels of the individual J by E’, E”, +--+, and the multiplicity 
of E”) by n, We now distinguish between the various variables 
having the same “ principal quantum number ” v by an “ auxil- 
iary quantum number ” k, which assumes n, values. An energy 
level of the type 


E 4E” +e et EY (8.15) 
of the unperturbed total system Jf has the multiplicity 
. flnn ++ + Ny, 


and the corresponding tensor coefficients are those obtained 


from those of type 
L2- f 
F(p ky .. L) 


by any permutation s of the f pairs (v|k) of arguments ; we write 
instead 


x(s|kıka + + > Ry) or briefly x(s|k). 
Similarly the coefficients of the energy matrix are denoted by 
a(s|kyRe s.s Ry ; tilla =. la) = (sitk ; 1), 


The energy levels W arising from (8.15) by the perturbation 
and lying in the term system x are, to a first approximation, 
determined by 


ZW = Z Rash ; k)x(s), (8.16) 
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where a)(s|%; D = 1 or O according as s = 1, k = L or not, and 
the composition is defined by 


Arii(S|k ; D = Ya,(st|k; m)a(t\m; D. (8.17) 
t,(m) 
If the unperturbed energy level is of the form 
f'E + fE” + eee (f +f +. . 7 . = f) 


the tensor coefficients in which we are interested are those ob- 
tained from 


F(, Bi Be oo 
ku Rit t? ky Reet t ee 
e ee N een 
F f” 

Let exactly fı of the auxiliary quantum numbers kpv = 1, 
-+ + f^ have a certain value k,, fo a different value ką, etc. ; 
Atfet:::=f', and let fi, fa -< have the analogous 
meaning for the quantum numbers k,(v = 1, ---, f”) associated 
with the principal quantum number 2, etc. Then those per- 
mutations p which leave the above tensor coefficient unchanged 
constitute a certain sub-group w,, depending on the distribution 
of auxiliary quantum numbers k, of the group m’ introduced in 
the non-degenerate case above; the order of m, is [k] = filfa! 
- ++ fil- ++. a(s|k; T) is unchanged when s is multiplied on the 


left by an element of m, and on the right by an element of 77}. 
The formula (8.16) now becomes 3 


W = zizel: kixts) } (8.18) 


ay(s|k;7) = 1 or 0 according as k =l and s = | (mod. mj) 
or not, and in the composition rule (8.17) we first sum with 
respect to £ mod. Tp and then over the various possibilities 


m = (my, Myo, ttj May tty ttt). 


In every case we obtain explicit expressions for the sums of 
the various powers of the perturbed energy levels in terms of 
the character x of the term system under consideration and the 
exchange energies a(s). 


§ 9. Relation between the Characters of the Symmetric 
Permutation -and Affine Groups 


The thorough correspondence existing between the repre- 
sentations of the symmetric permutation group a, and the 


ne ee me 
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representations of order f of the linear group ¢ must lead to 
a simple relation between the corresponding characters. In 


dealing with the linear group it suffices to consider only the 
“ principal transformations ” 


Ny —> EX; (i = l, 2, -s n) (9.1) 


of the vector space R = R,, for any linear transformation is 
conjugate within ¢ to a principal transformation—except for 
those cases in which two or more of the characteristic numbers 
e; coincide. Furthermore, if we restrict ourselves ab initio to 
the unitary group u—the one in which we are interested in 
physics—the result is valid without exception and the e; are 
complex numbers of unit absolute value. The problem here 
proposed is identical with that of investigating the distribution 
of the terms of If among the various term systems y in the 
absence of interaction between the various individuals and when 
the single system J is non-degenerate, for on choosing a Heisen- 
berg co-ordinate system x; in the system space of J (i.e. one in 
which the operator representing the energy of J is in diagonal 


form) the variable x; assumes the multiplicative factor e( — =e) 


h 


in time ż. 

We denote the characteristic * of the representation § of 
the linear group whose substratum consists of all tensors of the 
form êF by X(S) or X(€, Ez, © © +, En) where the element S of ¢ is 
the principal transformation (9.1). The £, are to be considered 
as n independent variables. The transformation of tensor space 
associated with (9.1) consists in multiplying the coefficient 
Fli, tg, + + +, ty) of the tensor F by &;- 8° +t ey The sum of 
all these multipliers, extended over all linearly independent 
coefficients of a general tensor of the form F’ = êF, is the desired 
characteristic. A component in which fq of the arguments 72 are 
equal to 1, fẹ are equal to 2,- - + is multiplied by eft-ef- - - af, 
But the number of linearly independent components of F’ of 
this type is, by equation (8.13), 


Alfio (9.2) 


here y is the character of the representation Ñ of 7y, the sum 
being extended over all elements s of the group m = a(fi, fe, °°) 
which permutes the first fı numerals among themselves, the next 
fa among themselves, etc. That this number (9.2) depends only 


*We prefer, for the sake of clarity, hereafter to employ the word 
“ characteristic ” for continuous and “ character ” for finite groups. 
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on the character y is a fact of greatest importance for our present 
considerations. The result is 8 


Sigeli... 
Xen ea = E AA ea 


where the inner sum is extended over all the elements s of 
mfu Ja °° *). We denote the value of the character y for an 
element s belonging to the class f of conjugate elements of 
my by x(f) ; our formula may then be written 


... (k) 
X = { f) S Cnt - + (P) neh. . ‘, 94 
2 x) a falfa! . pee ( ) 
where cry, . . . (k) is the number of elements of w(ji, fa'e) 


belonging to the class £} This number can be evaluated in an 
elementary manner. 


Distribution of Permutations in Classes. 


Any permutation s is a product of cycles, no two of which 
contain a common numeral. The 5-term cycle (1 3 7 2 4) is 
a permutation which sends 1 into 3, 3 into 7, T into 2, 2 into 4, 
and 4 into 1 again; writing these 5 numerals at equidistant 
intervals on the rim of a wheel, this permutation may be con- 
sidered as the rotation of the wheel about the angle 27 5. Given 
any permutation, for example 


123456789 
Vy yey eed (9.5) 
347198265 


the cycles may be separated out by first determining the number 
(3) into which 1 is transformed, then the number (7) into which 
3 is transformed, etc., until a number is obtained which has 
already appeared in the cycle; this number can, of course, 
only be 1. After separating out the first cycle the remaining 
numbers can be handled in the same way, and the process may 
be continued until the desired result is obtained. The per- 
mutation (9.5) is, in terms of its 3 cycles, 


(13 7 2 4) (5 9) (6 8). (9.6) 


The reduction of an arbitrary permutation into its cycles is 
obviously unique. This way of writing the permutation enables 
us to tell at a glance whether two given permutations are con- 
jugate in my or not, for an element conjugate to (9.6) is obtained 
by replacing the numbers 1, 2, 3, 4,- ++ by the same numbers 
in any order. The class Ë to which an element s belongs is thus 
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determined entirely by the number of cycles and the number 
of integers they contain; in particular, any permutation s and 
its inverse s~! belong to the same class. We denote the class 
f whose elements s consist of 7, cycles with one numeral, 2, with 
two, ig with three, + + + by (i 2,73 - - *) and write x(f) = x(t, 22° *) ; 
naturally 


li, + ia + 3i,+--- =f (9.7) 


The number K of classes is the number of solutions of (9.7) with 
non-negative integers ty, 29, 23, °° *. 
The number of elements in the class Ý = (itat - © +) is 


— J! 

PO = REIER “8 
To show this we write the f integers 1, 2,---, f in any of the 
f! possible orders and divide off each of the first 2, integers by 
parentheses, then divide off the next 27, in groups of 2, the next 
37, in groups of 3,:++. The symbol so obtained is to be inter- 
preted as the expression of permutation in terms of its cycles. 
Each of the f! possible arrangements so obtained leads to a definite 
element s of the class f, and all such elements must be included. 
We must now investigate how often the same s occurs among these 
f!. Now the 5-term cycle (1 3 7 2 4) can also be read as (3 7 2 4 1), 
(72413), etc.: the particular integer with which we begin is 
immaterial; such a cycle will occur five times. Hence those 
li 2's 3¢ + + + arrangements which differ only by a cyclic per- 
mutation of the numerals in each cycle are all associated with 
the same clement s. Furthermore, the 1, 1-term cycles may be 
written down in any order, the t, 2-term ones in any order, etc., 
and these 1,!2,! + - > arrangements all lead to the same element s. 
Hence each element occurs exactly 1'17,!2’27,! + + + times, and the 
total number of elements in the class is accordingly given by 
(9.8). 

We must also determine the number of elements of f which 
are contained in the sub-group (jf, fo,°:°). For this purpose 
we divide the numbers from 1 to f in sections of lengths fi, 
fe, * + + and consider only those permutations s which permute 
the numbers of the first section among themselves, the numbers 
of the second among themselves, etc. On dividing s into cycles 
as in the above some of the cycles will be contained in the first 
section, i.e. will consist only of numerals belonging to the first 
section, some will be contained in the second section, etc., and 
no cycle will consist of numerals belonging to different sections. 
Denoting the number of l-term cycles contained in the first 
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section by in, the number of 2-term cycles in this section by 
iia etc., whence necessarily 


124, + ‘214. + 3145 + n|. = fr, 


the number of permutations of 1, 2, + +», fı satisfying this 
requirement is, by (9.8), 


_ ft 
lhal: © CD o e (9.9) 


Proceeding analogously for the 2nd 3rd, etc., sections, the number 


of permutations in m(fifa* * *) satisfying all our requirements is 
given by the product of all numbers of the form (9.9) for the 
various sections. But such an element is a member of the 
class Ý = (ii,  - °) if and only if 


Dita, = 1, Litas = ty, tt ti (9.10) 
æ a 


fin O= TT DS Dai =h 


where the sum is extended over the various solutions of equations 
(9.10) and 
DY ty = fa LV ig = Jas ° 
The inner sum in (9.4) is accordingly 
l Elo Exai as 
liZ’ . Si [I lar! teal \ 
(i) & 
the only restriction on the sum being the conditions (9.10). Let 


o, = E, F Eg F ° . "+ Ep, 
o, = Ey + Geto + En 


hence 


Our results can be expressed entirely in terms of these sums of 
powers, for by the multinomial theorem 


S(t} ge 
4 wlan 1, 
Sis b= nel. 
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where the variables 2,4,2,2, ° + °, over which the sum is extended, 
are subject to the restrictions (9.10). We thus finally obtain 
the simple formula 


_ Xliii °° jo? og: -| 
X (£, Eo, ) En) == > na . i! Ta! . l (9.11) 
f 


We have so far made use only of the elementary connection 
between the groups m and ¢. If we now introduce the assumption 
that the number field over which our algebras are defined is 
algebraically closed, and is in particular the continuum of all 
complex numbers, the primitive characters of the finite group a 
have the orthogonality properties 


SnlBxbxtE™) =h, 
Zn (E) =0 (x +x’). 


Furthermore, the number of primitive characters is equal to 
the number K of classes. The above relations assert that the 
matrix of the y(f), where y runs through the entire set of primitive 
characters and f all classes, has as its reciprocal the matrix 


Z n(x. 


Hence we also have 


Exx = 
Sxi» = 0 for f + Ë. 


This is, in fact, merely an alternative form of the completeness 
theorem. In dealing with the symmetric permutation group ry 
f-1 = f and the order ish = f!. 

On multiplying the expression (9.11) for the primitive 
character X by y(tyt, * © +) and summing over all the primitive 
characters y of my, we obtain, with the aid of the relations 
derived above, the important formula 


il te 


Oyo, °° = 2 X(ty2e n )X(e,, Eaton En) (9.12) 
X 


where y and X are the characters of corresponding irreducible 
representations of my and C,. 
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§ 10. Direct Product. Sub-groups 


Programme. 

If two atoms or ions with fı, fa electrons, respectively, come 
together to form a molecule we may to a first approximation 
neglect the interaction between the two atoms so long as the 
distance between them is relatively large. In this approximation 
the two kinds of electrons are dynamically different, for the 
electrons of each atom are influenced only by the nucleus and 
the remaining electrons of the same atom. The symmetry is 
therefore described by the sub-group 7’ of the symmetric group 
m = ry of f = fi + fa things in which the first fı and the last f, 
things are permuted among themselves. A similar situation 
arises when three or more atoms come together to form a 
molecule. These considerations immediately suggest the follow- 
ing problems. 

I. The theory developed in §§ 2-4 is to be extended to the 
case in which the symmetric permutation group is replaced 
by any permutation group ~m’. Naturally the definition of a 
symmetric transformation in tensor space is to be adapted to 
the new situation: we require only that the coefficients 
Q(ty* ++ iy; ky * + + Ry) of (1.2) remain unchanged under an ~ 
arbitrary permutation belonging to the group n’ of the sub-indices 
1, 2,--°-,f Wesay that these transformations are symmetric 
with respect to m'; they constitute an algebra 2’ which is 
obviously more extensive than 2.—This question is immediately 
settled by the remark that all our previous deductions are valid 
for an arbitrary permutation group m’. Here m’ is considered as 
an independent group rather than as a sub-group of the sym- 
metric group. 

II. Let the set of integers from 1 to f be divided into two 
or more sub-sets. We consider, as an example, the case of 
two sub-sets : the “ red’’ numerals from 1 to f, and the “‘ green ” 
ones from 1 to fe; Jı + fa = f. Let consist of all permutations 
of the red among themselves and the green among themselves. 
Hence a permutation s’ = (s,, $2) of 7’ consists of a permutation 
Sı of the f, red numerals and a permutation s, of the green ones ; 
mw is the direct product mı X na of the symmetric group m of f, 
and ma of fa things. Or conversely, this direct product—the 
abstract definition of which has nothing to do with the group 
of permutations of f things—may be considered as a sub-group 
m of the symmetric group of f= fı + fa things on arranging 
the sets of numerals, on which permutations of m, m, act, one 
after the other to form a single set. But here we are interested 
in the following problem (which can be proposed for arbitrary 
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finite groups): to discuss the properties of a group 7, X 7r 
which is the direct product of two finite STOUPS Ti, To. ° 

ITI. In order to discuss the structure of molecules we must 
eventually take into account the interaction between the various 
atoms or ions contained in the molecule. This means that we 
must finally return from the sub-group 7’ to the full symmetric 
group 7, so we must examine the relations existing between the 


group 7 and its sub-group m’. Here again the problem is not 
restricted to permutation groups. 


Direct Product. 


Let 7, ma be two finite groups of orders f, fa respectively. 
The elements of the direct product m = m, X 7, are the pairs 
(Sı, S2) Consisting of an element sı of m, and an element S of 
Ta An element of the algebra of m is accordingly a function 
X(Sı, Sg), and it follows from this that the algebra of m is the 
product of the algebras (7) and (,) : 


(m) = (ay) X (779) 
in the sense of the x -multiplication of vector spaces introduced 
in Il, § 10. An element x: %,(s,) of (m) and an element x, : 


%9(S2) Of (ma) yield the element x = x, X x, of (7), whose com- 
ponents are given by 


%(S4, Sp) = %4(Sy) * Xolsa). 


Indeed, given any two algebras p,, pa, their direct product 
P = py X pe can be constructed and multiplication in p defined by 


(a, X aa) (bi X ba) = (aby X agba) 


whether they are group algebras or not. 

If Pa is a linear sub-space of 1, = pẹ (a = 1, 2), an element 
x: X(Sı, Se) of (m) is in p = p, X p, if and only if it belongs to 
Pı when considered as a function of s,, holding s, fixed, and to 
Pa when s, is held fixed; indeed, any element of this kind can 
be expressed as a linear combination of products of the form 
A X ag, where a, is in p, and a, in py If paela = 1, 2) is an 
tnvariant sub-space of tą, generated by the idempotent element 
€x and the representation space of the representation Ñy of px 
induced in Pa by the regular representation, then p is also 
invariant, has as generating idempotent element e=e, X ez 
and is the substratum of the representation Ñ}, X Ña of p. Itis 
evident that the equivalences p, ~ pi, Pe ~ P2 imply the equi- 
valence Pı X Pa ~ pı X fo. 

Suppose the two p, considered above are also irreducible 


334 THE SYMMETRIC PERMUTATION GROUP 


with respect to their algebras p,; the question then arises as 
to whether p, X P is irreducible (with respect to p) and whether 
P = Pı X Pa is equivalent to p’ = p; X pe (pa irreducible) only 
if Pi~ Pi, Pa~ ps. p and p’ are inequivalent if exe’ = 0 
identically in x, i.e, if the sub-space consisting of elements of 
character (e, e’) contains only the element 0; here e = e, x ĉo, 
e' = 6, X e). Now the formula 


(e1 X @g)(%1 X Xa)(€1 X e2) = eX ye, X Egal 


shows immediately that the sub-space (e, e’) is the direct product 
of the two sub-spaces (e,, ¢1) and (es, e2), and can consist merely 
of 0 only if one of these two sub-spaces consists merely of 0, 
ie. only if p, is inequivalent to p; or p, is inequivalent to pz. 
Our second question is thus answered in the affirmative—regard- 
less of the nature of the field over which the algebras are defined. 

The first question is answered in the affirmative in III, § 9, 
for the only case of physical interest, i.e. that in which the field 
is algebraically closed. If we are more interested in the re- 
duction of the algebra than in the representations we can argue 
as follows. The algebra of elements of character (e, e) is the 
direct product of the field (division algebra) ®, of elements of 
character (e,, ¢,) in p, and the field ®, of character (es, e2) in pg. 
Assuming the original field is algebraically closed, all elements 
of ®, are multiples of ¢, and consequently all elements of P 
with character (e, e) are multiples of e. This proves the irre- 
ducibility of p, X pa. If, however, the original feld over which 
the algebras are defined is not algebraically closed our assertion 
is correct only if the direct product ®, X ®, of the two fields 
is again a field, and this is by no means always the case. But 
in any case the question concerning the nature of the direct 
product of algebras is, as in the question concerning the structure 
of an algebra in § 7, reduced to the analogous problem for fields 
(division algebras). 

Again taking the fundamental field to be the continuum of 
all complex numbers, the complete reduction 


n= EW, w= pP 


into irreducible invariant sub-spaces Pa has as a consequence, 
in accordance with the above, the reduction of t = tı X T, into 
invariant irreducible sub-spaces p® x p% 


Sub-groups. 


Let 7’ be a sub-group of the given finite group 7. An element 
x of the algebra t’ = p’ = (z') of x’ consists of components 4’ (s’) 
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associated with the various elements s’ of mw’. However, such 
an element can, and in the following will, at the same time be 
considered as an element of the algebra p = (7); we need only 
to define the components x’(s) associated with elements s of ~ 
which are not contained in 7’ as zero. This disturbs in no way 
the addition and multiplication of elements of (m) with each 
other or with arbitrary numbers of the field. An element x of 
(7) ‘‘belongs”’ to m’ or “lies ” in (a’) if and only if all com- 
ponents x(s) associated with elements s of the group that are 
not in w’ vanish. 

An irreducible invariant sub-space p’ of t’ is generated by a 
primitive idempotent element e’ and is the substratum of a 
representation }’ of m’ induced in p’ by the regular representation. 
On reducing the modulus 1 of 7’ into independent primitive 
idempotent elements 


; 
= Sat (10.1) 
i=] 


a certain number, say g’, of elements e; will appear which are 
equivalent to e’; the sub-spaces p; which they generate are all 
equivalent to p’ and the regular representation of 7’ contains }’ 
g’ times. Equivalent summands are added together into 
such partial sums. Considered as an element of the total 
algebra p = (m) e’ is, however, in general reducible into inde- 
pendent primitive idempotent elements : 


b 
e = Zea for (10.2) 

ot 
Here again equivalent summands on the right are collected 
together into partial sums; let the e, in the first such partial 
sum generate the representation } of m—we shall in the following 
be interested only in these. Let the sub-space p with the 
generating unit e be a representative of the sub-spaces p, gener- 
ated by the e,. The elements of (7) of the form xe’ constitute 
an invariant sub-space <p’) which is the substratum of a re- 
presentation <h’> of r induced in p’ by the regular representation 
of m. Our formula asserts that on reducing <h’> into its irre- 

ducible constituents } occurs exactly b times. 

In order to obtain a simple characterization of the elements 
of <p’> we divide the elements of the group 7 into sets of group 
elements which are equivalent mod. 7’; the w™ such class 
consists of the group elements o,s’, where s’ runs through the 
sub-group 7’. An element x of the algebra (m) has as components 
x(oys’); the numbers x(o,s’) may, for fixed u, be considered as 
the components of an element x), of the algebra (7’), so that x 
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may be considered as the set of elements x, belonging to the 
algebra (m’). The formula y = xe’ then becomes y, = xe’ in 
(7’): hence x belongs to <p’> if and only if all the partial 
elements x’, lie in p’. The correspondence 


x —> y = ax 
may then be written 


or 
f f 7 
Yy = LibyyXy 
v 


where a is the element of the algebra (7’) defined by 


Aul S) = a(o,s'o,'). 

The representation <h’> may therefore be constructed as follows : 
first associate with the element æ of (m) the matrix |[a’,||, the 
coefficients of which are elements of the algebra (m’) instead of 
numbers, and then replace each a by the matrix Aj, associated 
with it in the representation H’ of 7’. 

As we have seen in the earlier part of the present chapter, 
the representations are obtained with the aid of a double Peirce 
decomposition ; we therefore consider the elements x = e’xe’ of 
character (e’, e’). The idempotent elements ea, © + - appearing 
in (10.2) are of this character, and such an element x may be 
expressed in terms of its components 


b 
x= DI egkeg tee. (10.3) 
a,A=1 


We now repeat the analysis of § 7 for our more restricted set 
of elements: let I’, be a one-to-one similarity correspondence 
of Pa on p and let the element into which e, is sent by the corre- 
spondence I°,I's* be denoted by eap". If, as we now assume, 
the field over which the algebras are defined is algebraically 


closed e,%ég is necessarily a multiple x. of esp. We then obtain 
instead of (10.3) the reduction 


xX = a Xap Exp + so, (10.4) 
(where the x,, are numbers) and the representations 
%—> [leagh «> - (10.4’) 


* Here, as in § 7, but in contrast with our usual notation, the product of 
two or more correspondences T is to be read from left to right, 
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Now if in particular x is in (’) then x = e’xe’ is a numerical 
multiple of (10.2), and the matrix ||x,|| associated with such an 
element is a multiple of the unit matrix.—The degree of the 
secular equation, the solutions of which determine the character- 
istic numbers, is thus decreased from g to b for an element x 
of character (e’, e^). We now proceed to examine the cause of 
this. 

Let I;~* be a one-to-one similar correspondence of p’ on p; 
(i = 1, 2,---, g’), and let the element into which it sends e’ 
be b;. On considering an arbitrary element x of the algebra 
of r as the set x), we see that the correspondence 


xe’ —> xb; 


is a one-to-one reciprocal and similar mapping of <p’) on <p,’>: 
the projection I’; of p; on p gives rise to such a projection of 
<p,> on <p>. This projection associates with the reduction 
of <p> into irreducible invariant sub-spaces a reduction of the 
same kind of the sub-space <p;>; corresponding to equation 


(10.2) we obtain the equations 


b 
6; = Leg to -. (10:5) 

a=l 
On combining (10.1) and (10.5) we obtain a reduction of the 
modulus 1 intoindependent primitive idempotent elements of | 
(1). Now consider the partial sums X'e; of 1 and their reductions 


t 
(10.5) as written one above the other. Each row is then as- 
sociated with a definite representation H of z’ and each column 
on the right-hand side, the terms of which are sums of the form 
>» Xex, is associated with a definite representation h of m. We 
t & 


now collect together all the summands ey occurring in the first 
column on the right, i.e. all those elements ey which are equivalent 
toe. The set of indices J is then broken up into sub-sets, each 
of which is associated with one of the inequivalent irreducible 
representations b’, -+ of w’; the first of these sub-sets, which is 
associated with ’, consists of the bg’ double indices az. 

Let the similarity projection II! of p; on p; send e; 
into e., If x’ is an element of (7’) the equation 

x = Yex'e +e 
i,k 


yields the reduction 
x = Skrb tit! (10.6) 
i,k 
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with numerical coefficients x,,, and x’ > \|v,,[| is the representa- 
tion ’. (The partial sums should preferably be written one 
above the other rather than horizontally.) I; may be con- 
sidered as a similarity transformation of <p;> on <p’> and 
therefore contains a transformation of the same type of p,, on 
Pa; II, then provides us with a similarity correspondence 
of Pas on p. Let Iy be a fixed one-to-one similarity correspond- 
ence of py on p and let the similarity correspondence I'jI';’ of 
py on px send ey into ey, x. We may take the correspondence 
I;I, as Iy for the index J = at, and similarly for the remaining 
sub-sets. On applying the correspondence D; Dy =r (II,)7 
to equation (10.5) we find 


b 

ei. = È Cai; Ea tt (10.7) 

The equation 

x= Deseeg tes += Axror (10.8) 
JK J, K 


then determines the representations 


ho: «> lsrls 


By (10.6) and (10.7) the matrix associated with an element x 
of (7’) is 


J 
Mets pk = 8x typ, Xjg =Q 


where the two indices J and K belong to different sub-sets, 
But this means that on restricting m to 7’ the representation h 
is reducible into the irreducible representations ’, - «+ of 2’, 
bh’ appearing exactly b times. We have thus obtained a con- 
structive proof of the theorem °? : 

First Reciprocity Theorem (for arbitrary groups). If <H 
contains the representation } of n exactly b times, then on restrict- 
ing the group n to 7’, Ņ contains the representation b’ of n’ exactly 
b times. 

If the sub-group a’ consists merely of the unit element 1 
this theorem reduces to our previous result: the number of 
times an irreducible representation appears in the regular 
representation is equal to its dimensionality. Both the com- 
plete theorem and this special case depend on the assumption 
that the field over which the algebra is defined is algebraically 
closed. 

Connection with Symmetry Classes of Tensors. 


We apply the results of our investigation III to the symmetric 
group 7 and make use of the correlation described in I above for 
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m as well as for its sub-group 7’. An irreducible sub-space p 
of (m) determines a symmetry class $ = tp of tensors; let the 
corresponding representations of m and the linear group ¢ be 
ý) and §, respectively. An irreducible invariant sub-space p’ of 
(x) determines a symmetry class $’ of tensors which is invariant 
with respect to the more extensive algebra 2’ of all transforma- 
tions which are symmetric with respect to 7’; as such $’ is 
irreducible. If e’ is the generating unit of p’, $ consists of all 
tensors of the form @’F; but this is equivalent to saying that 
the symmetry element F of (r) belongs to <p>. Hence the 
reduction of $’ into irreducible invariant sub-spaces with respect 
to the more restricted algebra 2 parallels the reduction of <p’). 
Let h’ be that representation of 2’ induced in p’ by the regular 
representation of 7’ and H’ that representation of ¢ whose sub- 
stratum consists of all tensors in the symmetry class W’. Hence 
our general theorem—or rather its converse, the truth of which 
follows immediately from the theorem itself—allows us to state 
the 

Second Reciprocity Theorem (applicable only to permutation 
groups). If the irreducible representation } of m contains the 
irreducible representation Ņ' of n exactly b times when considered 
as a representation of the sub-group 7’, then conversely the repre- 
sentation )' of c contains the representation H exactly b times. 

Finally we take w’ as m, X mg as in step I] above. p° can 
then always be taken in the form P, X Pa and the irreducible 
invariant sub-space Pa of (m) determines a symmetry class Pa 
of tensors of order f, (a= 1, 2). Denote the corresponding 
representations of 7, and ¢ by Ñ, and Úa. The ‘8’ associated 
with p’ = p, X Pg consists of all tensors of order f= fi + fz 
which satisfy the symmetry conditions of $, with respect to 
their first fı indices and the symmetry conditions of P, with 
respect to the last fa; ie. P = P, x Pe Our theorem now 
becomes : 

Third Reciprocity Theorem (for permutation groups). If the 
irreducible representation ý of m contains, on restricting m to the 
sub-group n = m, X Tma the representation Hı X ha of a’ exactly 
b times (Ha an irreducible representation of m,), then conversely the 
representation Kı X He of c contains the representation H exactly b 
times. 


§ 11. Perturbation Theory for the Construction of 
Molecules 


We return to the investigation of the physical system J 
consisting of f electrons or equivalent individuals J. As long 
as we disregard the interaction between the individuals we obtain, 
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among others, f!-fold energy levels E of the type (8.4). We 
consider in particular the case in which the Æ; are different 
simple levels of the individual J. In order to follow the resolu- 
tion of E, due to the mutual interactions of the electrons, to 
the approximation which characterizes the perturbation theory, 
we must first determine the elements a of the algebra of r, the 
components a(s) of which are the exchange energies, and trans- 
form the matrices corresponding to a in the various irreducible 
representations of m into diagonal form by an appropriate 
change of co-ordinates (§ 8). We now assume that the most 
important of the exchange energies a(s) are those belonging to 
the permutations s of a certain sub-group 7’ of m; all others 
shall be small in comparison with them (“ quantities of 274 
order’). Our procedure is divided into two steps, corresponding 
to the investigation of sub-groups carried out in the preceding 
section, Let a’ denote that element of the algebra (’) which is 
defined by 


a'(s) = a(s) or 0 


according as s is an element of the sub-group 7’ or not, and let 
the matrices associated. with a’ in the irreducible representations 
Ņ' of 7’ be referred to principal axes; then 


eae, =0 (ik, gae = W,’ e. 


The characteristic numbers W; are the energy levels on neglecting 
perturbations of 2° order; we assume they are all different. 
In order to examine the further resolution of such a term 
W = W; under the influence of the 2™ order perturbation we 
need, in accordance with the perturbation theory, to consider 
only that part 

a* = e’ae’ 


of a which is of character (e’, e’), where we have written e’ in 
place of e; This term yields b terms W, belonging to the 
symmetry class x associated with the irreducible representation 
h of r, the values of which are the characteristic numbers of 
the matrix ||a,,|| associated with the element a* = e’ae’ as in 
(10.4’). All the algebraic elements appearing in these con- 
siderations are real and the corresponding matrices are con- 
sequently Hermitian. 

We apply the procedure to the process by which molecules 
are constructed from their constituent atoms.° We consider 
as an example two atoms joining to form a molecule, the one ' 
containing fı and the other f, electrons; f=f,+/, We 
consider the two nuclei as held fixed at a distance d apart, which 
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is large compared with the linear dimensions of the atoms, and 
attempt to determine their interaction energy as a function of d. 
The sub-group a’ = m, X m, consists of all permutations which 
send no.electron of one atom over into the other; we have seen 
in § 10 that we may then take the primitive idempotent elements 
e =e of the algebra (m’) in the form e, X e, where e,, ez are 
in (m1), (Ta) respectively. On neglecting the interaction between 
the electrons of the one and the electrons of the other atom we 
obtain an energy term W which belongs to definite symmetry 
states of both atoms. e’ generates a sub-space P = P, X Po 
(of the tensor space R) which is invariant under all symmetric 
transformations 7 that the state of the molecule is described 
by a tensor of this sub-space $ means that the state of the first 
atom is in $, and that of the second in %,. Hence on reducing 


P in parallel with the reduction of <p’> into irreducible in- 
variant sub-spaces : 


e = Deg too, p> = DOH P= RO, 


there occur b sub-spaces P which are equivalent to one another 
and which belong to a certain representation of a or to a certain 
symmetry class of terms of the total system. The procedure 
sketched in the preceding paragraph thus leads to b terms which 
(1) arise, due to the perturbation, from the given unperturbed 
term (8.4) and (2) which belong to certain given symmetry 
states xı, Xz and x of the two atoms and the molecule. This 
reduction of the total system space Rf into sub-spaces, each of 
which corresponds to a definite symmetry state of each of the 
atoms taken separately and of the molecule, naturally is not 
bound up with the approximate calculation of levels with the 
aid of perturbation theory; the connection between the two 
appears only on taking the above condition (1) into account— 
the very essence of which implies the assumption of small per- 
turbations. This somewhat sketchy account of the situation 
arising from an unperturbed term of the type (8.4), in which 
the energies E; of the individual J are non-degenerate, can readily 
be extended to cover other more complicated types of unper- 
turbed terms. These other cases are of course of much greater 
physical interest, for we have seen in Chapter IV that all atomic 
energy levels, except S-terms, are necessarily degenerate.™ 

The fact that the total system may be in any one of several 
symmetry states $, corresponding to different energy levels 
(i.e. binding energies), when the symmetry states of the com- 
ponent atoms are given is of greatest importance. We shall 
later show that these possibilities, finite in number, coincide with 
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those predicted by the empirical theory of the valence bond, and 
that consequently the symmetry state of an atom is that which 
chemists call its valence state. The situation thus arising cannot 
be described adequately in terms of classical models—e.g. the 
fact that the two H atoms constituting an H molecule can be- 
have in such a way that the state of the molecule may lie in 
either the space of symmetric or anti-symmetric tensors of 
order 2; only the first case can lead to an attraction which will 
bind the atoms together—the second always results in a re- 
pulsion. The binding energy between two ions of total residual 
charges é,, e, is naturally due mainly to the Coulomb potential 
€,é,/d (‘‘ ionic binding ” or ‘“‘ polar bond ”’), but the corresponding 
energy for two neutral atoms is due for the most part to the 
interaction of the ‘‘ exchange energies ’’ a(s) of the electrons of 
the two atoms (“atomic binding” or ‘‘ non-polar bond’’). 
This quantum-mechanical solution of the puzzle offered by the 
non-polar valence bond was first given by F. London and 
W. Hettler. 

The following points are to be taken into consideration in 
applying the theory of perturbations to the actual evaluations. 
On neglecting the interaction between the various electrons 
each is subject only to the attraction of the two nuclei; we 
should therefore perhaps begin with the characteristic numbers 
E; and the corresponding characteristic functions (xyz) of 
this one-electron problem. The first approximation should then 
be obtained by taking into account the repulsions between the 
electrons of each of the atoms separately, thus introducing a 
dynamical difference between the two kinds of electrons. This 
procedure is naturally significant only so long as the distance d 
between the atoms is large in comparison with their linear 
dimensions a. But then it is also reasonable to take as our 
0 approximation that in which each of the electrons is subject 
only to the attraction of its own nucleus (plus the closed shell 
of electrons which are not to be taken into explicit account in 
the calculations). Let this one-electron problem for the first 
atom have the characteristic values E, and characteristic func- 
tions %; and let the corresponding quantities for the second 
atom be Ey, py. The fact that the %, and the p;p together 
cannot constitute an orthogonal system—indeed, they are not 
even linearly independent, for the #; alone constitute a complete 
orthogonal system—causes some difficulty. But if we break off 
the series of quantum states at a finite »—which can be chosen 
higher the larger the value of d/a under consideration—the 


finite set 
ip: ws, Wo, mo ey wns aby, the, se Dy 
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of functions % constitute an almost orthogonal system; the 


fundamental metric form Gp, the coefficients of which are the 
scalar products 


Sin = (Pa Pi) = joi py dV 


(where 7 and k run through the primed as well as the un-primed 
indices), differs but little from the unit form, Indeed, an integral 
of the form (i, wy) is of order of magnitude e-ta To show 
this we note that if the two centres of force are nuclei or closed 
cores with ‘‘ unit’’ residual charge, the normal states of the 
atoms are given by 


. grla l 


1 
hS Jra h Be Vra 


. pr'a 
ela 


where v and v’ are the distances to the two cores. The integrand 
in 


(by, by) = z Zafe (r+ rya qy 


is everywhere & 7%. This integral can readily be exactly 
evaluated on introducing bi-polar co-ordinates (r, r’, $); the 
volume element is then 


dV == try ‘ dr adr’ 


and the range of integration is defined by 
ytr 2d, —dsr—r sad. 


On introducing 


r+ r —r , d 
d = P, d =P, a =À 
we obtain 
+1 
A3 , , 
nh) =F] [e apap 
-1 


l 
(e — 5)e eMdp = e a(i tA+ y y, 
For the f-electron problem we therefore start with the 


functions 
Ming + +5 i) = Haleya) 
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as approximations to the characteristic functions; in this 
product the co-ordinates are those of the f electrons and t runs 
through the values 1,, 72, © * *, 77, each of which is one of the primed 
or un-primed indices between 1’ and n’ or l and n. The funda- 
mental metric form G = Ga X Ga X °'* X Go has as components 
the scalar products of #(i,, i, © © + ty) with (ky, Rg ' °° ky) and 
the components of the energy H, the potential part of which is 
obtained by adding together the potential energics resulting 
from the attractions and repulsions of the various electrons and 
the two cores, are the scalar products of #(i, » - * iy) with the 
vector Heb(k, ++ + ky) into which p(k, - + - Ry) is sent by the 
operator H. We consider the resolution of the unperturbed 
term 


E = (E+: +++ En) + (Ev +: +++ By). 
The components 
Gli, ++ iy; Ryo ++ Ry) and H(ty+ ++ ty; kateek), (ILI) 


in which the indices 7, k are permutations s, t, respectively, of 
l, +*+, Ju I, °° +, fa, are of the form G(si~) and H(st™). We 
introduce the (real) elements @ and H with components G(s) 
and H(s). G and H are next replaced by G’ and A!’ with com- 
ponents G(s) and H(s) if s is in m’ = m, X ma and 0 otherwise ; 
the justification for this lies in the fact that the components 
associated with an s which is not in 7’ are very small—they are 
of relative order e~*4/*, @’ is in fact the modulus, whereas G 
is not; the procedure employed previously must therefore be 
modified in the following purely formal respect. On repeating 
the reasoning, keeping in mind the fact that Œ is no longer the 
modulus, we find as the secular equation for the determination 
of the b terms À = W, 


| AGug — Hag | = 0, (11.2) 
in which 


e’Ge’ = 2 Gap Cup + ar 
ap 

e'He' == Z Hap Cup + es 
“p 


in terms of the notation employed in the preceding section. 

This procedure is open to the criticism that whereas the 
second order perturbations between the electrons of the same 
atom are neglected, the interaction between the two atoms, which 
is considered to be of second order, is taken into account. The 
results are therefore inapplicable to the limit d/a -> œ and can 
at most be applied successfully in cases in which dja is consider- 


“te, a - 
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ably larger than 1 but not too large. On the other hand, we 
could begin by assuming that the solution of the quantum 
problem for the individual atoms is already known. Let the 
function #, of the co-ordinates of the first f, electrons be a 
characteristic function of the first atom corresponding to the 
energy term E, (so normalized that the integral of d,s, is unity) ; 
it will belong to a certain simple symmetry state of the first 
atom, i.e. there exists a certain real primitive idempotent element 
e; of (7,) such that ê pı = p Similarly, let 4, be a character- 
istic function of the second atom for the term E, having a 
corresponding property êp, = p, Neglecting the interaction 
between the atoms, ẹ = ya . ýa is a characteristic function of 
the molecule consisting of the two atoms and having the energy 
E =E +E, e =e, X &, is a primitive idempotent element 
of the algebra of 7’ = a, X m, and ẹ has the property 
e's = y. 

The functions sy, which are obtained from % by the totality of 
f! permutations $ of its arguments, span a linear function space 
(R) of a finite number of dimensions—in which the sy are natur- 
ally neither linearly independent nor mutually orthogonal. 
The theory of perturbations requires us to find those functions 
¢ of (R) which are such that the orthogonal projection of Hd 
on (R) is proportional to ¢ itself; the factors of proportionality 
are then the values of the displaced terms, to a first approxima- 
tion. We must therefore evaluate the integrals G(s, t), H(s, £) of 


th-sb and th- H(si) 
and solve the secular equation 
JAG(s, t) — H(s, t)| = 0. 
G and H depend only on tts :* 
G(s, t) = G(t4s), H(s, t) = H(t74s). 


This is proved by the fact that the integral of +d is unchanged 
on replacing Y, $ by rd, rọ (r an arbitrary permutation) ; H(sẹ) 
is equal to sHy because of the symmetry of the operator H. 
Let Œ and H again be the elements of (7) with components 
G(s), H(s). They satisfy the equations 

ce'Ge = G, eHe =H 

* On comparing this with (11.1) it is to be remembered that there the 

permutations s and ¢ operate on the indices and not on the arguments ; hence 
the elements (11.1) are, in our present notation, 

G(t-1, s-}) and H(t-1, s7), 
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and are therefore of character (e’, e’). Indeed, we have, for 
example, 


p= Eet) «mp, whence’ H(s) = De) - H (sry), 
and on multiplying this latter by ẹ and integrating we find 
H(s) = De'(r)A(sr) or H = He’. 


It then follows that also H = @’H whence, since e’ is real, 
H = e'H and consequently H = e’He’ as asserted. 

The only non-vanishing elements of the matrix ||Hzxll, 
which corresponds to the element H in the representation h, 
are (in the notation of § 10 with e; = e’) those contained in the 
square sub-matrix of length b in which the row and column 
indices J and K are of the form al. We are thus led directly 
to the secular equation 


| AGag — Hag | == 0 


of bth degree. (The most natural method of solving this equation 
consists in finding that linear transformation which sends the 
Hermitian form with coefficients Gag into the unit form and at 
the same time reduces ||H,,|| to diagonal form.) 2H. is then 


a 
the trace of the matrix belonging to Ħ in the representation b, 
or 


If in particular b = 1 the above symmetry system of the 
molecule contains but a single term arising from the unperturbed 
term E; its value is, in accordance with the equation derived 
above, given by 


LH (s)x(s) _ E + 2'H(s)x(s) 
Sel TESCO xs) (11.3) 


The accent on the right-hand side indicates that these sums are 
to be extended over only those permutations s which do not 
belong to 7’. This formula (11.3) is due to F. London.’ It 
will be shown later that in the case of diatomic molecules b 
is always 1; we must expect, however, to find higher values of 
b in dealing with more complex molecules. The real difficulty 
from the physical standpoint naturally consists in getting in- 
formation concerning the exchange energies H(s). It is to be 
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noted, however, that we need only to concern ourselves with the 
sums 
DH (rsr), Z G(rsr7}) 
r r 

over the various classes, for since x(s) is a class function all 
summands in (Ll. 3) for elements in the same class f may be 
added together to give the above coefficients multiplied by x). 

Without doubt these investigations, which are as yet in their 
infancy, are of fundamental importance for theoretical chemistry ; 
the non-polar bond is due to the exchange energies. Heisenberg 
has given an explanation of ferro-magnetism with the aid of 
these same principles. 14 


§ 12. The Symmetry Problem of Quantum Theory 


On taking the spin into account the components of a vector 
x(vt), which represents the state of a single electron, has two 
indices ¿ andi; the first of these refers to the spin and runs from 
l to v, while the second refers to the translation and runs from 
lton. Actually v = 2 and n = oo (as long as we do not restrict 
ourselves to the consideration of quantum states with fixed 
energy). Our vector space R is accordingly R,n = R, X Rn. 
The state of a system consisting of f electrons is now to be 
represented by a tensor of order f in this space: F(t, tols, 

- *, tyty)—a “double tensor’ which stands, so to speak, with 
one foot (the Greek indices) in the space R, and the other (the 
Latin indices) in Rp. This tensor space is completely reducible, 
with respect to the algebra 2,, of all symmetric transformations 
of the index pairs (uw), into irreducible invariant sub-spaces, 
each of which is generated by ~.n idempotent symmetry operator. 
The Pauli exclusion principle states that only one of these sub- 
spaces §8,, is physically realized ; it automatically abolishes the 
physically absurd existence of multiplicities which cannot be 
resolved and at the same time denies the existence of absolutely 
non-combining systems of terms. Furthermore, according to 
Pauli this $8, is the space {R/,,} of all anti- -symmetric double 
tensors. 

On ignoring the spin perturbation, $,» is to be reduced as far 
as possible into sub-spaces P which are invariant with respect 
to the special symmetric transformations of the form 


F (ein i) = Feli s+ o ap; katik): Plaka + eves) (12) 


which do not depend on the Greek indices at all; these constitute 
our old algebra 2 = Sa This transition from 2,, to 2, is to 
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be accomplished in two steps. We first ignore the interaction 
between spin and translation, but allow the translations to 
interact among themselves in an arbitrary manner and similarly 
the spins among themselves ; we must then consider only the 
symmetric transformations of the form 


Ye tt bey Ky tt Ky) elt tt tp; Ry + + ke). (12.2) 


These transformations do not constitute an algebra themselves, 
but they belong to their ‘‘ enveloping ” algebra 2, x X, which 
consists of all transformations whose coefficients 


Chti © + + tig) Kiky + + e KyRy) 


are unaltered on subjecting the two rows i * ** ty; Ky °° + Ky 
of Greek indices to the same arbitrary permutation o and the 
two rows of Latin indices to the same arbitrary permutation s. 
The second step then consists in letting y in (12.2) be the identity. 
The first step thus consists merely in making the permutation 
of the Greek indices independent of the permutation of the Latin 
indices, and the second in restricting the first of these permuta- 
tions to the identity. 

In the first place, then, we introduce the elementary sym- 
metry operator o X s which, on applying it to the double tensor 
F(uiy * -> tty), subjects the Greek indices to the permutation 
o and the Latin to the permutation s. The general symmetry 
operator is then an arbitrary linear combination 


a = Jalo, s)(o X S$) 


of these elementary ones ; we have thus to deal with the algebra 
p X p of elements x, the components x(o, s) of which are functions 
both of whose arguments run through the elements of the group r. 
We denote the element with components F(o, s) = (o x s)F 
by F; the equation fF’ = af (F’ the double tensor obtained 
from F by the operator a) is equivalent to F’ = F.a. The 
group 7 X n of elements o X s contains ~ itself as the sub-group 
consisting of elements s X s. So far as the first step is con- 
cerned, our problem amounts to the following: Let (s) be the 


components of a primitive idempotent element of the algebra 
t = p= (m); we set 


I= Sil(s)\(s x s) 


and study the elements of the form xl in p X p. They con- 
stitute an invariant sub-space (r x r), which is to be reduced 
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into its irreducible invariant constituents; in Pauli’s case we 
have in particular 


l= FE A(s x 5) 


The procedure which it seems natural to follow is first of 
all to express the modulus 1 of p in any two ways as the sum of 
primitive independent idempotent elements : 


l= Je, l= Se, (12.3) 
t j 


An arbitrary element x of the algebra of p X p is reduced into 
independent constituents in accordance with the equation 
x = Lx(e; x é;) = Z Kij (12.4) 
i, j ij 
Now we know from § 10, II, that the elements of the form x; 
constitute an irreducible invariant sub-space p,;; consider 


1,9 


in this light. The projection x y = xl sends p,; over into 
a certain invariant sub-space (p,;) of (rt X t). Since those 
x of Pı; for which xl = 0 constitute an invariant sub-space of 
Pı; we have only the two typical possibilities: either (p;;) = 0 
or this projection x — xl maps p,; in a one-to-one and similar 
manner on (P;;). The sum 


(CX th = £ (Pu), (12.5) 


arranged in some particular order, is such that each term can, 
in virtue of its irreducibility, only either be contained in the 
sum of the preceding terms or be independent of this sum. On 
retaining only those terms arising from this second possibility, 
(t X t), is completely reduced into the sum of certain of the 
(Pa); the representation induced in (r X rt), by the regular 
representation of the group m X m is correspondingly reduced 
into its irreducible constituents of the form }’ x Ñ. It will be 
remembered that this symbol stands for the correspondence 


(e, s) > U'(e) x Uls), (12.6) 


where fy’, ) are the irreducible representations o —> U'(øo), 
s—> U(s) of a. This representation }’ X h appears with a 
certain multiplicity b(y’, x) which is determined by the number 
of pairs ij in (12.5) whose e,’ generate the representation ý’ 
and whose e; generate }. These considerations are of course 
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merely a repetition for the case at hand of the proof of theorem 
(6.1). 

We now return to the space of double tensors and consider 
the sub-space & defined by those of the form IF. It is the 
substratum of a certain representation U(2, X 2,) of 2, X En, 
and its complete reduction is given by the formula 


MS, x Ln) = Flr’, XG, X Gn). (12.7) 


This remains correct even if » or n is less than f. Earlier in 
this chapter we introduced the right- and left-invariant sub- 
space ty of t as that sub-space consisting of all elements F which 
correspond to tensors F in the n-dimensional vector space Rp. 
On denoting this tẹ which depends on n (and only for n = f 


n 
coincides with the entire r), by t we should consider the algebra 


y n v n 
t X tinstead of r x r. But if e is in t and e; in t, the manifold 


v n 
of elements x(e; X e;) is not decreased on restricting x to t X T, 


and every e, (e) which is equivalent to such an e; (e,) also 
E n 
belongs to r (t). This shows that (12.7) remains correct under 
n 


this restriction to r X t; the only effect is that those terms for 
which $, X , is the 0-dimensional representation are illusory 
We are now ready to take the second step: to perform the 
transition from the algebra 2, X 2, to 2 = X, by taking y in 
(12.2) as the identity. We then see immediately that the 
representation Q(X) of 2, whose substratum consists of the 
double tensors of Q in the sense of equation (12.1), is completely 
reduced into its irreducible constituents ©, corresponding to 
the various primitive characters x of 7, in accordance with the 


equation 
&(Z) = Lim(x) : 9. 


The multiplicity m(x) with which this representation occurs 
is given by ; 


m(x) = 4 d(x’, Nx’), (12.8) 


where N,(x) is the dimensionality of the representation 9,, 
and the sum is extended over all the primitive characters x’ 
of m. Hence on disregarding the spin perturbation we obtain 
the same type of reduction into non-combining systems of 
terms as before, except that the multiplicitv, which was previ- 
ously equal to the dimensionality g of x, is now given by (12.8). 
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(The spin perturbation causes weak inter-system combinations 
to take place and, in addition, resolves each term of the system 
x into its m(x) components. m(x) is the multiplicity of the 
multiplet structure. Term systems x for which m(x) = 0 do 
not appear at all.) 

Our reciprocity theorem enables us to determine the con- 
stants b. As mentioned before, a is contained in m X m as the 
sub-group of elements of the form s X s; the algebra p = (7) 
appears in p X p as the totality of algebraic elements of the form 
da(s)(s X s). The elements xl of the algebra p constitute an 


8 

irreducible invariant sub-space p,; let the irreducible repre- 
sentation of m which is induced in this sub-space by the regular 
representation be denoted by Ñ; and its character by A(s). The 
space of all elements of the form xl in p X p is then ¢p,> in the 
notation of §10; it is the substratum of the representation 
<p> of p X p. <> contains the representation }’ x § exactly 
b times; the reciprocity theorem then tells us that the number 
of times the representation h’ X Ñ contains the representation 
Ý on restricting m X a to its sub-group ~m is also b. Now this 
restriction to m sends the representation (12.6) of m x a into 
the representation 


(s, s) > U'(s) xX U(s) 


of m. This means, however, that b(y’, x) ts the number of times 
the representation }, of m is contained in the representation h’ X } 
of m (no longer with boldface multiplication sign !). Hence 
b is expressed by 
B(x’, x) = MXx"(s)x(s)A(s)}- (12.9) 

With this we have carried our solution of the problem of deter- 
mining the multiplicities m(x) as far as is possible in the general 
case, 

Consider in particular the special cases (1) complete symmetry, 
Q = [R], and (2) complete anti-symmetry, & = {Ji/}—the 
Pauli case. For the first A(s) = 1. With each irreducible 
representation y is associated the contragredient representation 
with character X(s) = y(s“); if the substratum of the first 
is generated by the idempotent element e the substratum of 
the latter is generated by é. Or we may describe this situation 
by saying that x and ¥ are the characters of mutually contra- 
eredient representations. (Accidentally y(s~1) = x(s) for the 
complete symmetric group 7; this does not hold for a general 
permutation group, however, whereas our entire theory does.) 
Equation (12.9) now becomes 


B(x’, x) = MEx'(s)x(s7)}- 
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But in virtue of the orthogonality property of characters this 
mean value is 1 or 0 according as the representation ¥ is equiv- 
alent to x’ or not. The expression (12.8) for the multiplicity 
then assumes the simple form 


m(x) = N,(X). 


The theorem that the representation ) x contains the identical 
representation s —> 1 once or not at all according as f’ is equiv- 
alent to the contragredient of or not is nothing other than 
the fundamental theorem [III, (10.5)] on which the entire 
theory of representations was based. 

In the second (anti-symmetric) case A(s) = 8. Now 


x*(s) = ð, * x(s73) 


is the character of the “ dual” representation )* associated ` 
with ); if Ñ is generated by the idempotent element e then }* 
is generated by the idempotent e*(s) = 8,-e(s-4). Or if 
h:s—> U(s) then h*:s5s-»8,- U(s). The expression for the 
multiplicity is in this case 


m(x) = N,(x*) (12.10) 


If we denote the 1-dimensional representation s —> 8, by {1}, 
the fundamental theorem mentioned above tells us immediately 
that §’ x } contains the representation {1} once or not at all 
according as }’ is equivalent to }* or not. (12.10) is the actual 
multiplet formula, for this second case is the one which is of 
interest for atomic physics, 


Additional Remarks. 


The only cases of importance for physics, (1) that of sym- 
metric and (2) that of anti-symmetric double tensors, can be 
handled by elementary methods. We again refrain as long as 
possible from making restrictive assumptions concerning the 
field over which the algebras are defined. The method will be 
illustrated by application to case (1). 

(12.11) If e, e, are equivalent idempotent elements, then 
é,, ĉa are also. 

Proof. Let p, be mapped on Pe by a one-to-one similarity 
correspondence I": x= xb; b is here the element, of char- 
acter (e, €a), into which e; is sent by I. Let the inverse corres- 
pondence carry €, over into a, which is then of character (e,, e,). 
I" carries a over into ¢,; since the element associated with a by 
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I is ab we have eg = ab. Similarly, we find with the aid of I~} 
that e, = ba. We then have | 


e, = ab, e= ba; ee; =a, ebe, = b. 
Conversely, the existence of these equations guarantees that 
Xo — Xb, XY — Xd 


are reciprocal similarity correspondences P, 2p. That is, the 
existence of these four equations means that e, and e, are 
equivalent. We need only to ‘‘ roof ” these equations in order 
to conclude that ê and ĉ, are then also equivalent—i.e., go 
over to the quantities % associated with each of these x by the 
definition (s) = x(s“4). We have here neither assumed that 
the e are primitive nor that the field is algebraically closed. 


(12.12). The invariant sub-spaces P, p generated by e, é are 
the substrata of mutually contragredtent representations. 

Proof. Let p consist of all elements xe; we introduce in 
addition to this left-invariant sub-space the right-invariant 
sub-space q consisting of all elements of the form ex. Let 
tr (xy) be the trace of the elements x and y, which may vary 
freely in p, q, respectively ; we assert that it is a non-degenerate 
bilinear form. That is: if tr (ay) = 0 identically in q then the 
element a of p must be 0, and if tr (xb) = 0 identically in p the 
element b of q must be 0. Indeed, if z is any arbitrary element 
whatever and a is in p, then 


az = ae * 3 = Q * ez = ay, 


where y = ez isin q. Hence the assumption that tr(ay) = 0 in q 
implies that tr(az) = 0 for arbitrary z, whence a = 0 [cf. § 4]. 
Similarly for the remaining case tr(xb) = 0. 

Now let p and q be referred to arbitrary co-ordinate systems 
and let the co-ordinates of x, y be Én €,° °°, És; Nu Na Mha 
respectively. Then tr(xy) is of the form 


tr (xy) =" Eine 


The theorem above shows that g Sh and h S g, whence h = g, 
and that the coefficients S; may be considered as the coefficients 
of a non-singular linear transformation. Hence on choosing 
the co-ordinate system in q in an appropriate manner tr (xy) 
may be reduced to the canonical form 


g 
tr (xy) = 2 Eii 
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tr (xy) = tr (yx) = tr (yr) + rx). 
Hence the simultaneous substitution 

x = 1x, yy’ = yr, 

which does not lead out of p, q respectively, leaves the trace 
invariant. These two transformations are therefore contra- 
gredient in the new co-ordinate systems; our assertion (12,12) 


then follows immediately on writing the second of these equations 
in the “ roofed’ form 9’ = ry and noting that ŷ runs through 


the left-invariant sub-space 6 generated by ĉas y runs through q. 
After this preliminary skirmish we apply the method em- 
ployed before, somewhat modified, to the case (1) in which 


l 
l = mA (S X 5). 
Fee XS) 
We are now interested in the reduction (12.4) only for symmetric 
elements x, i.e. elements which satisfy the equations 


xlor, sr) = x(a, s$) (12.13) 
for ally. This amounts to replacing x by xl; we subsequently 
note that xl(e’ X e) is not symmetric and accordingly multiply 
again on the right by /. We thus replace e’ x e by Z(e’ x e)l 
rather than (e’ X e)l and proceed to obtain an explicit expression 
for the reduction, rather than calling on the aid of the reciprocity 


theorem. First, the components of le’ X e) are (on ignoring 
the factor 1/f!) given by 


De (va\e(rs) = Jlr he (ro) = ée'(s~ 40). 


This expression vanishes if ée’ = 0; for e’ = ê we find it is 
equal to é(s-4c) = e(o™ts). This suggests that we choose 


l = Zéi, l = PNF 
{ 


as the two complete reductions (12.3) of the modulus 1. The 
only terms in the sum (12.4) which then remain for symmetric 

== xl are those of the form x(é, X e,;), and the factor (é; X e,) 
is the element with components e,(o7's). Since x(é X e;) has 
not been reduced identically to.0 on restricting x to the domain 
of symmetric elements, the sub-space which it generates is 


here, as before, equivalent to the irreducible p; x Pı The 
next step consists in multiplying on the right with }, whereby 
e(o~4s) becomes, in accordance with (8.3) and (7.22), 


A Ze(rto-sr) = axis = > -e(o7s). 
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Our final result is that an 
; y symmetric x ; 
cordance with y can be reduced in ac- 


2- ye! ” oa l 
x == xe’ + xe” + , where e(e, s) = 7 elos) 5 (12.14) 


+ 


in deriving this result it is to be remembered that the number 
of times any irreducible representation appears in the regula 
one is given by its dimensionality. suar 
It follows from the fact that e(s) is a class function that these 
elements E’, e” constitute a set of independent idempotent 
elements in p X p. This result is in fact obtainable by direct 
methods and is valid, regardless of whether the field in which 
we are operating is algebraically closed or not. To show this 
we note that any “ symmetric ” element x(o, s) is a function 
only of so™* in virtue of (12.13): x(o, s) = s¢(so72). Thus there 
exists a one-to-one correspondence between the symmetric 
elements of p X p—the space of which we denote by [t XxX t]— 
and the elements of t. Direct computation shows that this 
correspondence associates with each left-invariant sub-space of 
[r x r] a left- and right-invariant sub-space of r, and conversely ; 
the reduction of [t x r] into left-invariant sub-spaces thus 
parallels the reduction of t into sub-spaces which are both left- | 
and right-invariant. The whole problem is thus much simpler 
for [t X r] than for t itself ; its solution is obtained by carrying 
over the equation 
= xe -p xe” peo (7.5) 


for the algebra p to [t x t], the result of which is (12.14). 
Nevertheless we must return to the previous less elementary 
analysis in order to see—and this result presupposes that the 
field is algebraically closed—that each of the irreducible in- 
variant sub-spaces of [t X r] obtained in this way is equivalent 
to a sub-space of the algebra ït x t of the form 5 x p (where 
p and 9 are irreducible invariant sub-spaces of x with generating 
units e and @). 

The completely antisymmetric case can be dealt with in a 
corresponding elementary way. 

The complete reduction of the manifold Rİ of tensors in the 
9.dimensional spin space R, v = 2, is accomplished with the 
aid of the Clebsch-Gordan formula [III, (5.9)]. (c)f is ©, X © x 
.++ x ©, (f factors), where Œ, is the representation of the linear 
group ¢ = C by itself, and by the formula mentioned above this 
representation is completely reducible into the irreducible Cr, 
where v can assume only the values f, f—2,f—-—4 75 The 
dimensionality of ©, is v+ l, and to each of these possible 
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dimensionalities there corresponds here but one irreducible 
representation. Formula (12.10) then tells us that there exists 
only one term system having the multiplicity v-+1(=f+1, 
f—1, f—3,°:°-+); compare the beginning of § 15 on this point. 

The preceding analysis seems to me to be necessary in order 
to obtain a complete understanding of the relations implied by 
the permutation group without recourse to the approximation 
characteristic of the theory of perturbations. So far as the 
latter is concerned we proceed as follows. Again consider a 
term of the form (8.4) of the unperturbed system, the only 
degeneracy of which is that necessitated by the equality of 
the felectrons. The perturbation equation is then 


Fluis - + +, yi) = La(st) - F(tyky, «°°, yky), (12.15) 


where the a(s) are the exchange energies and 7, +--+ ty, Ry °° + Ry 
are obtained from 1» » - f by the permutations s, t respectively. 
Let ¢ be the tensor in spin space defined by 


F(ul, ted, * °°, yf) = Plite ss ty) ; 
the anti-symmetry of the double tensor F then tells us that 
F(tyty, a) Lyls) = Òs © SEa a tf), 
and on letting a’(s) = 8, + a(s), (12.15) becomes 
$ = a'd. (12.16) 
The problem is thus reduced to that of finding the characteristic 
numbers of this linear correspondence in the 2/-dimensional 
space Rf. 
Let %,(P) be the characteristic functions of the single electron. 
If the perturbation is due solely to the Coulomb forces between 
the various electrons, that part of the energy matrix a(i, + + + iy; 


kı : + + ky) which is due to the perturbation is obtained additively 
from terms 4 the form 


f ES ` PaP DEE AOD ay, av 
at p 


where a + 8 and the denominator is the distance between the 
two points Pa and Ps. The orthogonality of the ẹ tells us that 
this integral can be non-vanishing only if the permutation s, 
which sends the set of indices k into the set i (both of which 
are permutations of 1, 2, -- -, f), is either the identity or the 
transposition (œf). In this latter case we find 


a(s) = Eup T PaP m HPI, 
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On the right-hand side of (12.16) we then have only the terms 
arising from s = I and the transpositions s = (a8) : 


$ = {a(l) — ZE as(«f)}4. (12.17) 


Dirac has given a remarkable formula for the transposition 
acting on a spin tensor. Let ©- be the spin of the a electron ; 
Sž, S%, SZ are then the operators 

—1 1 0; 
0; 0 1 


0 I lO 
o f 

acting on the « index of the tensor Pliit tt ty). On calculating 
in particular 


? 


1 


(GS) = SIS? + Sts? 4 SIS? 


(which should perhaps be written (©! x ©3 instead, since ©} 


affects only the first index and ©? only the second), we find that 
it is the operator 


ty lg 

0 OO; 1 

1 0O — I 2 
1 21—11 


acting on the first two indices, all other places being 0. Hence 
{1 + GIG") is the substitution 


$(00) > $(00), PL1) > (I; (10) > g(01), (01) > (10) 
or the transposition of the first two indices. The energy (12.17) 
may then be written in the form 


H=E,y— 5 ZESA). (12.18) 


This may be interpreted as saying that the coupling between 


. . l 
the electrons « and f is responsible for the term — zE a5.) 
in the energy operator. However, the constant E, does not 


represent the energy of the unperturbed system.” 
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C. Expricir ALGEBRAIC CONSTRUCTION 
§ 18. Young’s Symmetry Operators 


We now supplement the general theory developed above 
by an explicit algebraic construction of the irreducible repre- 
Sentations of the symmetric permutation group 7 = ms. This 
problem is, as we know, equivalent to that of constructing the 
primitive symmetry classes of tensors of order f by means of 
idempotent symmetry operators e; here a “ primitive ” sym- 
metry class is one such that the symmetry of the tensors be- 
longing to it cannot be further increased by the addition of 
further symmetry conditions—such an additional condition 
either reproduces all the tensors of the class or reduces them all 
to 0. This construction is due to A. Young and G. Frobenius 38 ; 
with its help we are able to verify step by step the entire theory 
of representations of the symmetry group in an explicit and 
elementary manner. 

We are already acquainted with two very simple processes 
which yield tensors of maximum symmetry: “ symmetrization,” 
by means of which the tensor F yields the completely symmetric 
tensor }'sf, and “ alternation,” which sends F into Jẹ,- sF. 


8 8 
The first of these processes can be readily generalized as follows : 
We divide the range from 1 to n of the “ variables ” Tita "© * Ty, 
on which the general tensor component F(tyt, °° - ty) depends 
(or, what amounts to the same, the sub-indices | ae eee 8 
into sub-sets of lengths f, fa +; fy + Te++++=f. We then 
symmetrize with respect to the indices of each of these sub-sets. 


Pattern 7, 5, 4, 4, I. 


This distribution into sub-sets may be readily visualized with 
the aid of a “ pattern” P= Plh, fa +> *) as illustrated in the 
accompanying figure [for the pattern P(T, 5, 4, 4, 1)]; each of 
the f squares in the pattern is occupied by a different one of the 
f integers 1, 2,-- +, f Each of the sub-sets mentioned above 
constitutes a horizontal row of the pattern, and the various rows 
are arranged one under another. The individual sub-sets may 


YOUNG’S SYMMETRY OPERATORS 359 


be arranged in order of decreasing length: fi ZJ 2°-+-+; the 
pattern then consists of non-interrupted vertical columns as 
well as non-interrupted horizontal rows. Those permutations 
p which permute the members of each row among themselves 
constitute a sub-group (p) of r of order fi! fa! +- [denoted in § 8 
by a(f,, fo, °° *)|. The symmetry operator described above, and 
which is to be applied to an arbitrary tensor, is 


a= 2P; 
p 


henceforth p will always denote an arbitrary permutation which 
sends no numeral of one row into another row. 

So far we have made no use of the process of alternation. 
If after having symmetrized with the aid of the operator a we 
alternate with respect to certain of the variables or sub-indices 
1, 2,---, f, we certainly obtain 0 if any two of these numerals 
are in the same row, for the tensor obtained by the symmetriza- 
tion is symmetric with respect to any two such numerals and 
the result of subsequently alternating with respect to them must 
be 0. To avoid this situation we choose one variable in each of 
the rows and alternate with respect to them; since the order 
of the variables in each row is so far immaterial we may place 
these chosen variables in the first column. We then disregard 
the first column and proceed to alternate with respect to a set of 
variables obtained by selecting one from each row of the re- 
mainder of the pattern; these variables may now be shifted into 
the second column. This process is continued. until we have 
covered the entire pattern ; the result is that we have symmetrized 
with respect to the rows and have followed this symmetrization by 
alternation with respect to the columns. Let q denote an arbitrary 
permutation which permutes the variables in each column among 
themselves; these g constitute a certain sub-group (q) of ~. 
The alternation described above consists in applying the sym- 
metry operator 


b= Xò’ g, 
q 


and the entire process consists in applying the resultant operator 


c = ba = Xò, ' qP. 
P,@ 
We call c the Young symmetry operator belonging to the 
pattern P. | 
In order to obtain a unique symmetry operator ¢ associated 
with a given pattern P we must specify the way in which the 
numerals from 1 to are to be distributed in P: they shall be 
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introduced in such a way that on reading the pattern, as one 
would read a page of a book, they appear in their natural order 
1, 2,°-°°,f. If we write them in any other order, say that ob- 
tained from the standard form with the aid of the permutation r, 
we obtain a “ conjugate’ element c, which, as is readily seen 
on considering the relation between the tensors generated by 
these two operators, is related to c by 


Cr=re or ¢,(s) = ¢c(r7tsr). 


Hence the introduction of r results merely in a new name. 

From now on we operate with symmetry quantities, i.e. 
elements of the algebra (7), instead of tensors; we consider the 
invariant sub-space p, of r consisting of all elements of the form 
y = xê and the representation Ñ, of m induced in it by the regular 
representation. With p, is associated the symmetry class $, 
of all tensors of the form cF. If we replace c by one of its con- 
jugates c, we obtain instead of p, an equivalent invariant sub- 
space ; in this sense the order in which the variables are written 
in the pattern is quite immaterial. We hope that p, is irre- 
ducible and that the totality of representations ý, associated 
with all possible patterns constitutes a complete set of inequi- 
valent irreducible representations of m. This hope is strengthened 
by the fact that the total number of patterns is just equal 
to the number of inequivalent irreducible representations. To 
show this we note that the number of patterns is equal to the 
number of partitions of f into integral non-negative summands 
f=fı+ fat: -+ which satisfy the condition A 2f,2--° 
On writing 


fi—fe= Ty fe—fs=tn °° 


we see that this number is equal to the number of solutions of 
the equation 


Ir, + 2r, + 37, +: . -=f 


for non-negative integral r. But we have already seen that this 
is the number of classes of conjugate elements in a and, by the 
general theory, is therefore equal to the number of inequivalent 
irreducible representations of r. 

If the dimensionality n of the vector space is less than f 
the only non-vanishing symmetry classes are those arising from 
patterns containing at most n rows, for if the first column. is 
longer than x alternation with respect to the variables standing 
in it alone causes an arbitrary tensor to go over into 0. The 
only patterns which we need in this case are consequently those 
obtainable from the algebra to, instead of r, where Ty = bay as 
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defined in §2 above. The number of inequivalent irreducible 
invariant sub-spaces into which the tensor space Rf can be 
reduced is accordingly decreased to the number of partitions 
of f into n integral summands f = fa + fa +°- + fn for which 
f = fo 2° _ =fn = 9. 

A permutation $ = qp which is obtained by composition 
from a permutation p of (p) and a permutation q of (q) can be 
so obtained in only one way. This is an immediate consequence 
of the remark that the equation qp = I can be fulfilled only by 
p =l, q= l, for it asserts that p = q~' belongs to (p) as well 
as to (q). The components of the symmetry operator c can 
therefore be described as follows: c(s) = 0 unless s belongs to 
the set (q)(p) ; when s belongs to this set c(s) = +1 according 
as the unique decomposition s = qp ytelds an even or an odd 
permutation q. 

We must now prove the following three assertions con- 
cerning c: 

(1) c is essentially idempotent ; or, more precisely, c satisfies 
an equation cc = y’ c, where y is a non-vanishing numerical 
factor. Furthermore, y is an integral positive number which 
is a factor of f!. Then replacing e by e = c/y, e is idempotent. 

(2) The sub-space P, is irreducible, the e introduced in (1) is 
primitive. 

(3) Different patterns lead to inequivalent sub-spaces p.. 

The execution of this programme depends upon a simple 
combinatorial auxiliary theorem, which we now proceed to 
develop. Denote the lengths of the columns in the pattern 
P with rows of lengths fi, Ja °° by ft, fox: 


AZh2:* ff 2h" 2--:; 
Athtooc=Alth + Ef. 


We think of the pattern P as cut out of a rectangular chess- 
board consisting of f, horizontal rows and ff vertical columns, 
and the permutation s as operating on f chess-men occupying 
the f fields. On interchanging rows and columns in P we obtain 
the dual or transposed pattern P*. 

Auxiliary Theorem. A permutation s belongs to (qp) if and 
only if any two pteces originally in the same row are not sent into 
the same column by s. 

Proof. It is evident that this condition is necessary in 
order that s belong to (qp). The change of position which one 
of the pieces suffers as a result of s can be accomplished in two 
moves, a horizontal and a vertical move (in this order). It 
is at first conceivable that the horizontal move could send the 
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piece into a field of the original board which is not contained in 
the pattern P. If the decomposition s = gp is possible p must 
represent the horizontal move and gq the subsequent vertical 
one; it is clear that q and p are thus uniquely determined. 
Now if s satisfies the conditions enunciated in the above theorem 
the horizontal move can never throw them into the same column, 
i.e. the same field. It only remains to show that the horizontal 
move can never send any piece out of the pattern proper, or: 
those pieces which s sends into a column of length f* come from the 
first f* rows of the pattern. We divide the chess-board horizontally 
into an upper and a lower part, the upper consisting of the 
first f* rows. The pieces which are sent into the first column 
by s are, by assumption, from ff different rows; hence there 
are at least (and therefore exactly) ff — f* of them which come 
from the lower part of the board and not from the first f* rows. 
Note that ff — f" is exactly the number of fields in the first 
column which lie in the lower part of the board. On applying 
this argument to each column in succession we find that the 
number of pieces which s sends into those columns which pro- 
trude into the lower part of the board is exactly equal to the 
number of fields in this part of the board. Hence all the pieces 
in the lower part of the pattern are sent into columns whose 
lengths are greater than f*, and the only pieces s sends into a 
column of length f* come from the upper part of the board. 
This auxiliary theorem allows us to assert that if s does not 
belong to (gp) then there exist two pieces in a single row which 
are sent into the same column by $s. If u denotes the trans- 
position of the two pieces in their initial positions and v their 


transposition in the final then sw = vs; here u belongs to (p) 
and v to (q). 


§ 14. Irreducibility, Linear Independence, Inequival- 
ence, and Completeness 


_ We now examine the Young symmetry operators c associated 
with the various patterns. Obviously 


c(sp) = els), ¢(gs) = 8, * cls), (14.1) 


where p, q are, as usual, elements of (p), (g), respectively.” 

au par” (14.2). Any element a of (ar) which satisfies equations 
a(sp) = a(s), a(gs) = 8, - a(s), (14.3) 

1s a multiple of c. 


IRREDUCIBILITY, LINEAR INDEPENDENCE 363 
To prove this theorem we first note that (14.3) implies 


a(qp) = 84° a(l) ; 
on setting a(l) = A the equation 
a(s) = A> c(s) 


which is to be proved, is certainly correct for all group elements 
s of the form gp. We must next show that a(s) = 0 if s does 
not belong to the set (gp). Such an s implies that there exist 
transpositions u and v, lying in (p) and (q) respectively, for 
which su = vs. But then by (14.3) 


a(su) = a(s), a(vs) = 8,° a(s) = — a(s) 


} 


? 


whence a(s) = —- a(s) or a(s) = 0. 

Theorem (14.4). Every element of (m) of the form cxe is a 
multiple of c. 

It was shown in the general theory that this theorem is 
valid if c is a primitive idempotent element of (r) and if 
the field in which we operate is algebraically closed; here we 
approach it from the opposite direction, as we wish to show 
directly that it holds for c in order to prove that ¢ is primitive. 
Now obviously any element of the form xc satisfies the first of 
equations (14.3) and any element cx the second; hence any 
element of the form cxc has both properties and is consequently 
a multiple of c. 

Theorem (14.5). cc= yc and y is a positive integer which 
ts contained in f\. 

That cc is a multiple of c follows immediately from the 
previous theorem; y is therefore the number 


y= Zeie) = Fels) > els). 


Let the sub-space p, of elements of the form xé be of dimension- 
ality g. The projection 

x —> y = xé (14.6) 
projects any element x into an element lying in this sub-space 
and is, within p, itself, merely the multiplication y = yx. Its 
trace is therefore yg; to see this we need merely to adapt the 
co-ordinate system in group space to the sub-space Pe On 
the other hand its trace is immediately obtainable from (14.6) or 


y(s) = Fale; 


it is fie(l) = fl, hence 
yg = fi. 
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Consider the meaning of this fact that y is positive, i.e. that 
c(s)c(s-4) is oftener positive than negative ! 

e=c/y is idempotent; hence the character of the repre- 
sentation ), induced in P, by the regular representation is 
by (8.3) 


1 Sc(r-ts0). (14.7) 
Yr 

We obtain as a by-product the fact that the dimensionality g 
of the representation J, 1s a factor of f!. 

Theorem (14.8). P ts irreducible. 

We know already that this theorem is a consequence of (14.4), 
but it may be instructive to prove it directly as follows. Let 
e = c/y be reduced into two independent idempotent elements 
e& + é; then 


x(s) = 


el, = 8E = l whence eee = e. 


Now by theorem (14.4) any element of the form ee,e is a multiple 
of e; hence e¢, = Àe. e, = e, then yields the equation A? = A 
for the number A. Consequently either A= 1 or A= 0, ie. 
either e, = e or & = 0. 

We shall say that the pattern P’ with rows of lengths 
fu fa > < is higher than P if the first non-vanishing difference 
fi — fa te — fa +++ is positive. 

Theorem (14.9). If the pattern P' is higher than P then 
cc = 0. 

We do not here assume that the variables are written in 
the patterns P, P’ in the normal form agreed upon in the previous 
section—i.e. in which the numerals appear in their natural 
order on reading the pattern as one would a page of a book. 
The proof is based on the fact (F) that there exist two numerals 
which are in the same row in the pattern P’ and in the same 
column in the pattern P. Ifv is their transposition it belongs 
to the group (p’) associated with the rows of P’ and at the same 
time to the group (q) associated with the columns of P; hence 


c'(sv) = c'(s), clus) = — ce(s). 
On replacing vt in 
c'c(s) = Ze (st>)e(t) =: — Ze’ (st)e(ot) (14.10) 
by ¢ alone we find 
c'¢(s) = — 2ic'(stv)c(t) = — Ye (sheft) = — e'c(s). (14.11) 


(3 
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(F) is evident if the first row of P’ is already longer than the 
first row of P, for it is impossible to distribute the f’ numerals 
in the first row of P’ over different columns of P if f, < fy. 
If fi = fı and the numerals of the first row of P’ are actually 
distributed over different columns of P, we discard the first 
row of P’ and the f fields of P containing the same numerals as _ 
this row. On shifting the fields of P upward to fill in the gaps 
P is transformed into a pattern which has exactly the same 
appearance as if we discarded the first row of S; we are only 
interested in the fact that this process leaves all pieces in their 
original column. The proof can then be completed by mathe- 
matical induction—by assuming that it holds for the abbreviated 
patterns obtained by omitting the first rows of P and P’. 

Theorem (14.12). Let c, c’, ©- + be the Young symmetry 
operators associated with different patterns P, P'. -- -> the corre- 
sponding sub-spaces Pe, Po, ++ + are then linearly independent. 

Let the P, P’, P,- + be arranged in such an order that 
P is higher than P’, P’ higher than P”,---. An element x of 
P = Pe is reproduced by right-multiplication with ĉ/y but, by 
the previous theorem, this process transforms all elements 
x’ of p’, x” of p’’, +++ into 0. Assume there exists such a linear 
dependence | 


ete paw’ +++ += 0; 


on right-multiplication with ¢ we find x = 0 and consequently 
x +e’ ++++=0. The theorem is thus reduced to the 
same theorem for the smaller set P’, P”, and the proof 
follows by mathematical induction. 

Theorem (14.13). Different patterns P, P’ give rise to in- 
equivalent sub-spaces Pe, Per. : 

The proof is accomplished by a direct derivation of the 
orthogonality relations. Let P’ be higher than P. Since we 
did not assume in proving theorem (14.9) that the numerals 
were distributed in the same order in the two patterns P and P’, 
we may replace the element c with components c(s) by the 
“ conjugate ” element c,-1 with components c(rsr7}) : 


De (st *)c(rtiy-}) = 0. 
t 
Summation with respect to 7 yields 
Ee'(st) + xeli) = 0. 
On writing y = Xe x’ = xv this formula is equivalent to 
Exx) = 0. 
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In particular 


Zx x) = 0. 


If the two sub-spaces were equivalent we would have x’(t) = x(t), 


and since x(t!) = x(t) for the symmetric group the above 
equation would yield 
Zx (s) = 0. 


But this is impossible, for by (14.7) the character x(s) has 
rational components, and in particular x(l) = g + 0. 

This last conclusion is valid only if the number field in which 
we operate is non-modular ; naturally this restriction is irrelevant 
for physics. Nevertheless it constitutes a blemish which should 
be removed, for the remainder of our deductions only introduce 
the minimum assumption that f! is not 0 in the field under 
consideration, Now from the general theory we know that 


Theorem (14.14). Sx(s)x(s 3} = f!. 


The blemish mentioned above is removed by proving this 
theorem directly. We must show that 


Zx) * e(s) = 1 
or 
S efr ir hels) = 1. 


On replacing the summation variable s by sr, where r is fixed, 
this becomes 


Ze(lsrje(s r} = 1. (14:15) 
r, 8 
Consider next the function 
a(s, S) = Se(sr)e(s’r-}) ; 
r 
as a function of s it satisfies the second condition in (14.3). 
But the first of these conditions is also satisfied, as can be seen 
immediately by replacing y in 
a(sp, s’) = Le(spr)e(s'r) 
f 
by the summation variable p~1r. Hence by (14.2) 


als, °) = cls) + Ze(re(sr-3) = efs) + els) = “els)e(s’) 
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and therefore the left-hand side of (14:15) or 


Fals, s) = 2 Se(sjo(s- 


is actually equal to 1. 
The relations 

ZX(S)x'(S™) = 0 or f! (14.16) 
show that the primitive characters obtained by our construction 
from the various symmetry patterns are linearly independent, 
and since their number is equal to the number of classes of 
conjugates in the group 7, any class function can be represented 
as a linear combination of the x(s). In particular, the function 


1(s), which is 1 for s =| and otherwise 0, must possess such 
an expansion : 


fle Ls) = mx(s) + m’x'(s) be e+. (14.17) 


Multiplying by x(s7+) and summing over s we obtain, with the 
aid of the orthogonality relations (14.16), the equation 


F\x(l) =flm 


or 


| m= g | (14.18) 


for m. Since 


x(s) = Zelrsr™) = Zels), 


equation (14.17) gives the reduction of the modulus 1 into 
primitive idempotent elements e, Hence the regular repre- 
sentation is reduced into the irreducible representations e 
associated with the various symmetry patterns. Since f! 1(s) 
is the character of the regular representation, eq. (14.18) is a 
direct verification of the fact—proved in the general theory—that 
the number of times each irreducible representation appears 
in the regular representation is equal to its dimensionality. 
This completes our direct and elementary development of the 
theory of the representations of the symmetric group. 

The method of proof employed in establishing theorem (14.9), 
ie. that cc’ = 0 if P’ is lower than P, will now be used to answer 
another question. Let a be the operator, introduced in the 
previous section, which symmetrizes with respect to the ciphers 
occupying the rows of P: 


a(s) = 1 or 0 according as s belongs to (p) or not, 
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and let the numerals be written in the pattern P’, which is 
lower than P, in an arbitrary order. I assert that ac’ = 0, 
There exist two numerals which occupy the same row in P 
and the same column in P’. If v is the transposition of these 
two numerals then 


a(sv) = a(s), c'(us) = — e'(s), 


and the assertion is proved with the aid of (14.10), (14.11) o 
replacing c’, c there by a, c’. Hence also 


Salsi!) (rtr!) = 0, 
t 


2a a(st-4)y'(t) = 0 or Sar “ly (rs) == 0, 


That is, the sum of the x’(t) extended over all elements ¢ = rs 
which are left-equivalent to s mod. (p) [i.e. r in (p)], is zero. 
In particular, Xx (s) = 0, where the sum is extended over all 


elements s of (p); x’ is the character associated with a pattern 
P’ which is lower than P. On applying this result to the con- 
siderations of § 8 (in particular, to (8-13) ff.) we find: 

If the individual I has the simple energy levels Fy, Er ++ * the 
term 


fifi + fE t> Azher hAtht: - om f) 


of the unperturbed system I’ appears only in those symmetry 
classes of tensors whose pattern P’ = P(fy’, fy - + +) ts not lower 
than P = Pif `’) 

Thus we saw in discussing the two-electron problem that 
terms of the form E, + E, appeared in the “ anti-symmetric ” 
as well as the “ symmetric ° term systems, whereas terms such 
as 2E, appeared only in the latter. 

Finally, we consider the relations existing between two 
dual patterns P and P* with generators c, c* and characters 
x, x*. The group (p) which permutes the members of cach 
row of P among themselves coincides with the group (q*) which 
permutes the members in each column of P* among themselves ; 
similarly (q) = (p*). If s= qp is in (gp), then s~! = pulg-) == 
q*p* is in (q*p*), and conversely ; for such an element 


c(s) = 84, c*(s“) = 56 = 5,. 


Hence in general-—even when s is not in (gp) and, consequently, 
s~! is not in (q*p*)—we have 


c*(s“) = à, + ce(s). 
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tt 1) ` 
Dual” elements c, c* are therefore related to each other in 


exactly the same way as the “duals” i . 
Further y s” introduced in § 12. 


ye =y; X5 (st) = x*(s) = 85+ x(S); g* =g. 


If P is higher than Q, then conversely P* is lower than Ọ*. 
For if we lower P by taking away the last field of one of the 
rows of P and adding it to the end of a later (shorter) row, one 
of the columns of P is increased at the expense of a later (shorter) 
column; by such a process of shifting individual fields, in which 


no gap 1s to occur in a row or a column, P can be transformed 
into the lower pattern Q. 


§ 15. Spin and Valence. Group-theoretic Classification 
of Atomic Spectra 


If the vector space R = R, is only 2-dimensional, the only 
symmetry patterns which give rise to primitive symmetry 
classes of tensors of order f are those which consist of at most 
two rows. Let the first row contain l -+ v fields and the second 
I; then 

v= f — 2l. 


The symmetry pattern P is thus uniquely characterized by the 
number v, which we call its valence, and v may assume any of 
the values f, f — 2, f — 4, +. Let $e be the totality of tensors 
of the form cF obtained by applying the Young symmetry 
operator ¢ associated with the pattern P to the totality of tensors 
F, and let §, be the representation of the linear group, the 
substratum of which is the tensor manifold $, A sufficiently 
general tensor of order f which is symmetric in the first as well 
as the second rows of indices is given by 


CXEX' XEXE (1 + v terms) 
xypx O&K KY (l terms), 
where 
E = (Xy, X2)» Y = (Yı Y2) 


are two arbitrary vectors. On alternating with respect to the 
columns we find that the representation É. of the linear group 
¢ = ¢, is that one which is induced on the quantities 


(XY — X1) XT we (ryt t2 = 2)- 


Hence 9, is the representation of the linear group which was 
denoted in III, § 5, by G. 
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This remark supplies the connection with the symmetry 
problem of quantum mechanics as dealt with in § 12—on apply- 
ing the Pauli exclusion principle when the existence of the spin, 
but not its dynamical effect, is taken into account.'* Since 
the spin space is 2-dimensional, formula (12.10) tells us that 
the only patterns P which give rise to a term system are those 
whose duals P* consist of at most two rows, i.e. those P which 
themselves have but two columns. If v is now the number of 
fields by which the first column of P exceeds the second we call 
v the valence of the term system or of the corresponding state of 
the atom. The multiplicity of the term system with valence 
v is v+ 1l, and to each of these possible multiplicities corre- 
sponds but one term system as we have already scen in § 12 
(in particular p. 356). We previously (Chap. IV) called s = v/2 
the “ spin quantum number.” 

The fact that the longest column of P cannot exceed the 
dimensionality N of the vector space R, associated with the 
electron translation may result in a further restriction on the 
possible symmetry patterns P. This situation cannot arise 
as long as we deal with the total oo-dimensional system space. 
On the other hand if we restrict ourselves, for example, to those 
states of the electron which are characterized by a fixed principal 
quantum number n and a fixed azimuthal quantum number l 
—and which therefore constitute a (2/-+ 1)-dimensional sub- 
space R(nl) within Re—i.e. if we consider only those states of 
the atom in which all the f electrons outside a closed core are 
in R(nl), the dimensionality N is reduced to 271+ 1. Then f 
cannot exceed 2(2/ -+ 1) and the possible valences of the states 
under consideration are given by the following table : 


f=l1, % 3, 4 scree / 4 4lt+1, 42+2 
101 0 «ere: 0 1 0 
v 2 83 2 veeee. 2 
4 


This table again gives us the alternation law, but shows that in 
addition the number of possibilities decreases from the middle 
of the table on. The possible multiplet numbers 2s + 1 of 
terms in these states is one greater than v. 

This “valence ” v, which describes the symmetry state of 
the system, is actually the chemical valence, as was shown by 
F. London.'° We allow two atoms, consisting of fı, fẹ electrons 
respectively, to come togethér to form a molecule with f = f, + f 
electrons. Let P, Pa be irreducible invariant sub-spaces of 
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the system spaces Rf, Rf: respectively. In order to find which 
symmetry states the molecule is capable of assuming when the 
first atom is in the state $, and the second in $, we must com- 
pletely reduce the space P, x P into its irreducible constituents. 
If we consider this decomposition as taking place in the vector 
space of electron spin rather than in that of electron trans- 
lation (the justification for which will be given below), the 
problem is solved by the Clebsch-Gordan series (III, 5.9); it 
tells us that if the valences of the symmetry states of the two 


atoms are vy, Vg the resulting symmetry states of the molecule 
are those with valences 


zs Vy -+ Vo, Vy -+ Vo — 2, Vy + Us — 4, nN |v, — Val. (15.1) 


This situation can be readily visualized in terms of the symmetry 
patterns as follows. Bring the two symmetry patterns P,, Ps 
of the two atoms into the positions shown in 
the accompanying diagram and then shove 2 
vertically upwards, one field at a time, until one 
of the two columns of the combined pattern is 
closed; each of these steps represents a possible 
symmetry pattern for the molecule, in which v is 
the number of fields which are not paired hori- 
zontally. The saturation of the valence bonds 
here appears as the pairing of fields or, more physi- 
cally, as the saturation of the spin of an electron 
in one of the atoms with that of an electron in the . 2 
other. The empirical theory of the valence bond |—|— 
has therefore a rather profound significance. —|— 
We have yet to justify our use of spin space 
rather than translation space in the above. Let the representa- 
tion of the permutation group my corresponding to the two- 
columned symmetry pattern of valence v be denoted by 9, ; its 
dual > consists of but two rows. The Clebsch-Gordan series, 
together with the third reciprocity theorem of § 10 as applied to 
the linear group ¢ = ¢y, tells us that on restricting m to the sub- 
group a’ = m, X T which permutes the electrons of each atom 
separately the representation ý, of m contains the irreducible 
representation h, X h», of 7’ once or not at all, according as 
u is one of the values (15.1) or not. From this it follows im- 
mediately that the same result holds for the duals on reducing 
h, after restricting m to m’. Applying the same reciprocity 
theorem in the opposite direction for the case in which ¢= C, 
is the linear group in n dimensions, we find that the representa- 
tion Ha X Dua of ¢ (or the algebra 2) contains the representation 
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,» once or not at all according as v is one of the values (15.1) or 
not. On reducing Qa X O» into its irreducible constituents 
we may expect to find other representations—which may even 
occur more than once—in addition to these simple 0,, but these 
additional representations will correspond to symmetry patterns 
with more than two columns and are, in virtue of the Pauli 
exclusion principle, of no importance for physics. The number 
b introduced in § 11 is accordingly at most equal to 1 in the case 
of diatomic molecules. 

Molecules which consist of a larger number of atoms can 
be studied by the same method. If in particular we are in- 
terested in the case of three atoms and their valences are v4, Us Us, 
we can determine with the aid of the Clebsch-Gordan series 
the number b, of times the representation ©, occurs in the 
reduction of @,, x ©,, x @,, Those v for which b, =+ 0 are 
the valences of the possible symmetry states of the molecule 
and b = b, (which may here be greater than 1) are the corre- 
sponding multiplicities. The characterization of the quantum 
and symmetry states of a molecule which is formed by the 
union of three atoms in given quantum and symmetry states 
requires, in addition to the valence v, a further index which 
distinguishes between the various bẹ, possible energy levels. 
But this description of the various possibilities differs from the 
empirical theory of the valence bond—the manifold of possible 
bindings is smaller,?° 


Classification of Spectral Terms. 


Let the unitary or the complete linear group ¢,,, in the system 
space R of the single electron be restricted to the group ¢, X Ca 
of transformations $, X S,, the two factors of which are trans- 
formations of the spin and translation spaces R,, R, respectively : 
R= VR, xX Rn The space {MR} of anti-symmetric tensors of 
order f is then reducible into irreducible invariant sub-spaces 
with respect to the algebra of symmetric transformations of 
the form (12.2). We thus obtain a distribution (I) of spectral 
terms among the various symmetry classes; this step is of 
universal validity and is applicable to molecules as well as 
atoms, | 

The further classification of terms, as discussed in Chapter IV, 
A, refers to “simple’’ rather than ‘‘ quantum ” states, ie. to 
those states which are related to spatial rotation and moment 
of momentum in the same way that the quantum states are 
related to displacement in time and energy. Naturally this 
application of the rotation group D = b, (the elements of which 
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we now denote by a, 7, « - -) is significant only for atoms (or 
ions), the nuclei of which are considered as fixed centres of 
force. So long as we concern ourselves only with the electron 
translation and neglect the mutual perturbations of the electrons, 
which are characterized by principal and azimuthal quantum 
numbers n and l, each individual term of the system is char- 
acterized by the quantum numbers (n4, h; Mna lg; +++; my, l). 
The number of times such a term appears in a given symmetry 
system is equal to the dimensionality of the linear sub-space 
in which the atomic states under consideration lie. The resolu- 
tion caused by the mutual perturbations parallels the reduction 
of this sub-space into its irreducible constituents fiz with respect 
to the group D of rotations; the resulting components of the 
term have the natural multiplicities 2L + 1. The spin space is 
similarly to be reduced. Let bd induce the representations 
b, :o -> U(o) and ©: e — V (o) in R, and R, respectively. This 
second step (II), in which the spin and translation spaces are con- 
sidered separately, is interpreted from the stand-point of group 
theory as meaning that we associate with the element (os, 7) 
of bD xX d the transformation U(a) x V(r); we thus obtain a 
6-parameter sub-group of ¢, X Cy, and on restricting ¢, X Cn to 
this sub-group our original irreducible sub-space is further 
completely reducible into irreducible constituents. The irre- 
ducible representation of D x D induced in such a sub-space is 
of the type D, X D. The final step, (III), consists in introducing 
the coupling ø =r: the 6-parameter sub-group is thereby 
restricted to a 3-parameter sub-group, i.e. that sub-group 
induced in the total system space by the rotations ð. The 
spin perturbation then resolves each such term multiplet into 
its (at most 2s + 1) components : 


DX D= FB; Gel+slts— loos, (I — sl); 


naturally Ð, X D. is here a representation of b instead of D x D. 

Actually v = 2, and the transformations induced in the 
spin space Jt, by the rotation group constitute the unitary group 
in two dimensions, Consequently the transition from ¢, to }, 
in step (II) involves no reduction in spin space—this is the 
essential simplification caused by the fact that R, has so small 
a dimensionality. 

To the symmetry system of terms corresponds a certain 
irreducible representation of the unitary group u in the space 
R, of the electron translation and with it a certain irreducible 
characteristic (§ 9) 


X = X(e1, Eg °° *). 
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The co-ordinates v; in the space R, are broken up into classes 
in the manner described in Chapter IV, § 1: 


x(m) [m=1,1—-1,--+—]]; 
x'(m) [m =- —1, +--+ — i]; 

Each of these classes describes a (2/ + 1)-dimensional sub-space 
R(nl) of R, in which the group bd, of spatial rotations induces 
the irreducible representation ®;, and is characterized by the 
principal quantum number v and the azimuthal quantum number 
l. The arguments e, of X are correspondingly broken up into 
classes. To give the principal and azimuthal quantum numbers 
of the individual electrons—without stating how these numbers 
are distributed among the f electrons—we need only to state 
how many (f’) electrons are represented by states in each of 
the various sub-spaces R' = Ril). If, for example, 3 of the 
electrons are in R’ and the remaining 5 in R” (f= 8) we must 
separate out that part of X which is of degree 3 in the variables 
c; belonging to R’ and of degree 5 in those belonging to R”. 
The multiplicity M of the corresponding term 


E(ml,) + E(nd) +: ° 7 + E(nyl,) 


of the “unperturbed ” atom in the symmetry . system under 
consideration is then obtained from the part of X described 
above by setting all e contained in it equal to unity. In order 
to determine how this M-fold term is broken up on taking the 
mutual influence of the electrons into account we replace the 
variables e(m) of the class R(nl) by e(m) = e”, the variables 
e’(m') of the class R(n’e’) by e'(m’) = e™ (with the same 8), etc. 
The resulting expression must be a linear combination of the 
sums 


+D elti —g-—2I 


with non-negative integral coefficients. This enables us to 
tell which of the various total azimuthal quantum numbers L 
appear, and how often, in the resolution of the above term ; 
each such L-term has still the multiplicity 2L + 1. 

Example. We consider, as an example, the case in which 
f= 3 and all three electrons are in the same sub-space R(n). 
The possible symmetry patterns are 


-H -H LLLI 


Ll 
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The Pauli exclusion principle allows only the first two; their 
valences are v = 3 and v = 1, and the corresponding terms are 
therefore quadruplets and doublets, respectively. The first 
pattern defines the anti-symmetric tensors of order 3 and the 
third the symmetric tensors. The corresponding characteristics 
are therefore 


Xi = Sy= Š EEEn Xg = X Eijer 
i<j<k isjsSk 


On introducing 
— 2 _ 3 
S= LES; S53 = LG; 
tj 


we have X, = s1 + $+ 53. The dimensionalities of the re- 
presentations of m, corresponding to these three patterns, and 
therefore the numbers of times the representations X,, Xa X, 
of ¢ appear in (c)*, are easily shown to be 1, 2, 1, (in accordance 
with the equation 3! = 1? + 2?-+ 13. Now the characteristic 
of the representation (c)® of ¢ is 


t= (Ze? = S; + 3s, + 65,; (15.2) 
the equation 
th = X, + 2X, + X; = (25, + S2 + S3) + 2X, 
then allows us to conclude that 
Xo = Sp + 2s}. 


We prefer to carry out the evaluation with the aid of the sums 
of powers 


2 3. 
th, t= NE Die, b = DG ; 
i i i 
we then have 
lo = S3 F Sq, ty = S3 


in addition to (15.2). Consequently the characteristics in 
which we are interested are : 


Doublets : X, = als -~ tg), : (15.3) 


Quadruplets ` X, — E me ts) — (te -= t) |. (15.4) 


The solution of the problem discussed above is now obtained 
by replacing the 2/ + 1 variables ¢; by the set 


l ll * 6 œ —t 
et, gh}, 8 
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and then expressing t.f % as a sum 2a@,(L) of expressions of 


the form 
(L): epen +. . . -4 e77 


with integral multiplicities az. The computation is considerably 
simplified by multiplying both sides of the equation by e =) 
as (L) then becomes e4+1— e74, The multiplicities so obtained 
are given in the following tables : 


| t, | L= 3l, 3l — 1, 3 — 2,» l ia 0) 2a 
be a ] 2, 3, — 1,3, 5,-:° 
a a (increasing by x each step) (increasing by 2 
each step). 
re L = 31, 31— 1, 31—2, 3l — 3,- + + 0 
Multiplicity : l, 0, l, Oe ws 
(alternately 1 and o) 
Lells ey ee ey ee 0 
L, we 1, ’ a 1, AE 
(alternately r and — r) 
[a L= 3i, 3i1—1, 3! — 2, l8 i 31-5, - + + 
Multiplicity . I, — 1, 0, l Ak 0, f 


(repetition with period 3) 


On applying these results to the computation of X}, X, with the 
aid of (15.3) and (15.4) we find that the number of terms with 


total azimuthal quantum number L is as given in the following 
tables : 


Doublet System 


(1 L=0, 1, 2, | 3, 4, 5, 
O12 )/234 


up to L=. The period is here 3; the multiplicities in the 
second period are those of the first increased by 2, those in the 
third are obtained from those in the second by adding 2, etc. 


a E=3i, 3-1, 31-2 | 8—3, 31-4 31-5, | --- 
ò 1 1 | 1 2 2 


down to L =}. The periodicity is again 3, but the multiplicities 
in each period are obtained from those in the previous one by 
adding 1 instead of 2. 
Quadruplet Sysiem. The periodicity is here 6 instead of 3. 
(1) For the values of Z. from 0 to | the first period of multi- 
plicities (L = 0, 1, 2, 3, 4, 5) is for even Z: 010212 and for 


odd/: 101121. The multiplicities increase by 2 from period 
to period. 
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(2) For values of L from 3l down to l the first period is 
000101 regardless of whether L is odd or even, and the 
multiplicities are increased by 1 from period to period. 


§ 16. Determination of the Primitive Characters of 
wand r 


The guiding principle in the whole of the present chapter 
is the reciprocity between the symmetric permutation group my 
and the algebra X of symmetric transformations. But this 
latter can, as was shown in §1, be replaced by the special 
symmetric transformations induced in tensor space by the linear 
transformations of vector space and which constitute a group 
(c)’ isomorphic with the linear group c. Indeed, we may even 
restrict c to the unitary group u. The algebra 2 is thereby 
referred to a group—not to a finite group, it is true, but to a 
closed continuous group. Now we have seen in Chapter II] 
that we may expect such groups to behave in a manner entirely 
analogous to that met in dealing with finite groups, at least 
if we concern ourselves only with unitary representations. As 
a rule we find in mathematics that the continuum is more easily 
handled than a discrete manifold; the formula (9.11), which 
expresses the fundamental reciprocity mentioned above, will 
therefore better serve to compute y from X than the converse. 

We therefore next evaluate the characteristics X of the 
continuous irreducible unitary representations of the »-dimen- 
sional unitary group u by a direct method which is independent 
of our previous development. The case n =: 1 has already been 
solved in III, §8; the procedure there developed serves as 
a model for the present case. With this in mind we first prove 
the following auxiliary theorem : 

A continuous function f(w,, ws, ` * *, wp) of absolute value 
l which possesses the period 27 in each of the 2 real arguments 
and which satisfies the functional equation 


Kw + w) = RoE) 
is necessarily of the form 
flo)) = efhwi + haws +> $e + hnwn), 


where the constants / are integers. 
On introducing the functions 


flw) = flw, 0,0,°° 5, 0), fol) = (0, w,O0,° °°, 0), " 


TOARE ma Frente A T S ae BASEEN pR AE AA, 
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of one variable, we are able to conclude from the functional 
equation above that 


f(y, we, °° *) = filo flw) °° - 


It therefore suffices to prove the theorem for functions f(w) of 
one variable, and this we have already done [III, § 8]. 

Every element S of the group u is conjugate to a “‘ principal ” 
element E, i.e. to a transformation of the form 


L> Ek, (v= 1, 2,-- +, 2). (16.1) 


The numbers £, are of unit modulus and may therefore be ex- 
pressed as l 
e, = e" = e(w,) 

in terms of the ‘ angles.of rotation ” w,, wa * * +, œn (which are 
only determined mod. 27) of the unitary transformation S. 
In order to employ the orthogonality relations it is necessary 
to determine the volume dS of that portion of the group mani- 
fold u whose elements have angles between w, and w, + day. 
Gy, Gy, °° *, a, being any n numbers, let D(a, as, * * -, an) denote 
the product 

IT (a; — a,) = | ar}, rea, 1 | 

i<k 
of differences; the n rows of the determinant on the right are 
obtained by replacing a successively by a, a, °° *, a, The 
evaluation of the volume element dS will be carried out in the 
following section ; we here anticipate the result 


dS = A A dw, dw, +++ dwn, A= De, Ez, © © *, En). (16.2) 


The determination of the primitive characteristics of ut is 
accomplished by combining the following important facts.21 

1. Symmetry.—Each element S of u is conjugate to a prin- 
cipal element E, (16.1). Hence it suffices to determine the 
characteristic X of a continuous representation of u for such 
a principal element. FE goes over into a conjugate transforma- 
tion within u on permuting the ¢,: hence X is a continuous 


symmetric function of the angles w, and is of period 2a in each 
of them. 


2. Arithmetic Properties ——The principal elements constitute - 


an Abelian sub-group of u; on compounding two such elements 
E, E’ the angles w, w, are added. The normal co-ordinates 
Yx in representation space % can therefore be chosen in such a 
way that the principal elements correspond to principal trans- 
formations 


E: Yr Pedr 5 
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indeed, we have shown in I, § 5, that any commutative system 
of unitary correspondences can be brought simultaneously into 
diagonal form. On compounding two principal elements the 
condition that E be a representation is expressed by the functional 
equation. 


p(w, We, * ° Apelor Wo, 7 -) = p(w + W, wa + We, 7° ) 


for each of the multipliers p = p» The auxiliary theorem then 
tells us that each px is of the form 


e(hyw + . + hawn), 


where the constants A are integers.. The characteristic of the 
representation is the sum of these px ; hence X is a finite Fourier 
series in the arguments w wiih integral non-negative coefficients. 
The “weights” of a representation are the sets of exponents 
(hy, he, + + *, Mn) Of each term 


e(hywy + howe te °° + h,w,) = għ ele gin 


which actually appears in X. The term (hy, ha, *-'; h,) is said 
to be ‘‘ higher’’ than (hi, he, °° *, ha) if the first non-vanishing 
difference h, — hy, he — ho, °° * is positive. 

3. Orthogonality.—For all primitive characteristics X the 
integral 


Qn Qn — 
fe [XXAAdw, ++ > dwn 
0 0 
must have the value 
27 Qn 
V = f- [AT do: > > dwn (16.3) 
0 0 


These orthogonality relations suggest that we introduce the 
quantities é= A+ X in place of the characteristics X; they 
are also finite Fourier series, but they are anti-symmetric functions 
of the angles w instead of symmetric ones. hy, he, © * *; An being 
integers arranged in decreasing order 


hy > hy >: + >In, (16.4) 
we construct the “ elemental sum ”’ 


Elh ha °° *; hn) = Z t elw + hawa to t hawn), (16.5) 


i.e. the alternating sum over the permutations of the arguments 
w: the term which we have written down is the highest one 
in the sum. Every alternating Fourier series is a linear aggregate 
of such elemental sums; since the coefficients of these sums are 
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integers, and in particular that of the ‘“‘ highest” term is l, 
every alternating Fourier series, such as é, with integral co- 
efficients can be expressed as a linear aggregate of the form 


E = c Elly, ha +) tes Elly, hot +) + (16.6) 


with integral coefficients c, c’, +: °. Let this expansion be 
arranged in decreasing order, i.e. in such a way that the set 
(hy, ha * * *) of exponents is higher than (ki, hg, : °°), etc.; 
hy, ha, °° *) is then the highest term in é. A is itself an elemental 
sum, namely 


A= &n—1,”—2,+-+-, 1, 0). 
Hence if the highest term in X has exponents f, fa, + °, we have 
hy =f, + (n — 1), my ha = frit 1, hn = fn; (16.7) 


in the following the numbers f; and h; are always in the relation 
(16.7) with one another. 


We denote integration with respect to all the angles of 
rotation from 0 to 2m by a single integral sign and write dw 
for dw,dw,:+:dw,. We now calculate 


SEO fay «+ JE, ha «deo 


the % and the k’ are arranged in decreasing order in accordance 
with (16.4). Consequently no permutation of the h can coincide 
with a permutation of the h’ unless 


hy = hy, he = hə, ne) hn = h, ; (16.8) 
the integral of each of the (z ! )* terms in the product 
E(hy, he, as °) E(hi, he, o ) 


is therefore 0 unless (16.8) holds. In this latter case those 7! 
terms, for which the permutation of the h is the same as that 


of the h', each contribute (2:)" to the integral and all others 
contribute 0; hence 


| Eln, ha © © Elki hg, © © dw = f (27)” 


according as (16.8) holds or not. Applying this in particular 
to the elemental sum A, we find 
JA Ado = V = n! (2n)r. 
On setting the expansion (16.6) in the equation 
| dew =y 
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we find |c|? + |c’|?+---= 1. Since the c, c’, <- are non- 
vanishing integers only the first term can appear in (16.6), and 
we must have c == 1 or — 1, and since the coefficient of the 
highest term of € (as of X) must be positive we are restricted to 
the first alternative c= 1. We have thus shown that every 
primitive characteristic is of the form 


Elha, hy, > -) ee gh, ss + gin | 
X= a. can = eed) (16.9) 


where the h, are integers arranged in decreasing order : hy >h, >». 
The function defined by (16.9) is a finite Fourier series with 
the highest term (fı, fo, °° +, fn) ; the coefficient of this term, its 
multiplicity, is 1. ' 

4. Completeness.—The last question to be answered asks 
whether every function of the form (16.9) is conversely the 
characteristic of some irreducible representation of u or not. 
Our explicit algebraic construction allows us to answer this 
question in the affirmative. To show this we first note that the 
representation of order f arising from the symmetry pattern 
with (at most n) rows of lengths fı, fo, ** +, fa has as highest 
weight (fi, Ja * °°, Ja); this can be seen immediately by con- 
sidering the representation as generated by alternation from 
the product of n vectors, the first of which occurs f, times as 
a factor, the second fy, etc. (as in the simple case at the beginning 
of § 15). The f are here any integers satisfying the conditions 


h2h2:::2fnZ9. 


On dividing the transformation corresponding to the arbitrary 
element S of u in this representation by the J power of the 
determinant of S (l being any fixed non-negative integer) the 
highest weight of the resulting transformation is (f — l, 
Je—l,+*++,fn—l); this simple device thus enables us to dis- 
pense with the restriction fa 20. We have thus proved that 
all irreducible unitary representations of the unitary group un 
are obtainable by completely reducing the representations (u)! for 
f= 0, 1, 2, < > - into their irreducible constituents and multiplying 
by the 1-dimensional representations 


S > (det. S)’ [l => 0, + L, + 2, 7 ‘|. 


We have further shown that the characteristic of the irreducible 
representation H = (fy, fo, ++ *, fn) of order f of u, which ts gener- 
ated by the symmetry pattern P(fi, fo, +++, fn), is given by equation 
(16.9). 
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We could also have obtained this last result with the more 
transcendental method of proof employed in steps 1 to 3. If 
we are operating in the continuum of all complex numbers © 
' rather than an arbitrary field the proof of the completeness of 
the irreducible representations of a finite group can be formulated 
in such a way that it can be taken over immediately for the case 
of a closed continuous group with the aid of the theory of integral 
equations. The particular application of this general group- 
theoretic completeness theorem to the group Ð, of rotations of 
a circle into itself yields the completeness of the Fourier orthog- 
onal system ert (m = 0, +1, +2,- -). Its application to 
the closed group wu, yields the following two facts: (1) Every 
expression of the form (16.9) is in fact a primitive characteristic. 
For if it were not it would be a non-vanishing function of position 
on the group manifold—in fact, a class function—whose Fourier 
coefficient with respect to each irreducible representation 
vanishes ; it is indeed orthogonal to all other functions of the 
form (16.9). (2) We further find that the functions (16.9) 
constitute a complete set of orthogonal functions for symmetric 
periodic functions of w1, w2, ***, @_; this result is of no particular 
interest, as it is a consequence of the completeness of Fourier’s 
orthogonal system in one dimension. Our general considerations 
(1) to (4) yielded so many properties of primitive characteristics 
that we were able to obtain an explicit expression for them from 
these properties alone. 

Consequences.—The assumption that k, = fn 2 0 constitutes 
no actual restriction; the characteristic is then a symmetric 
rational integral function of the ¢ of order f. The e are in fact 
roots of the characteristic polynomial f(r) = det (71 — S) of 
the unitary transformation S; it is therefore possible to express 
X rationally and integrally in terms of the coefficients of this 
polynomial, and therefore in terms of the coefficients of the 
matrix S. The restriction to the unitary group can then readily 
be removed, but we shall not go further into these considerations 
here. ?? 

The dimensionality of the representation X is found by 
calculating X for the unit element, all of whose characteristic 
numbers £, are 1. On substituting directly in (16.9) we obtain 
the indeterminate form 0/0, so we proceed as follows. Take 


w, = (n — 1)w, we = (n — la, © > +, wp = Ow 
in terms of the single anglew. The determinant in the numerator 


of (16.9) is then the alternating sum of the terms obtained from 
the product 


e(hy(n — 1)w) + e(he(m — 2)w) - > - e(h,0w) 
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by permutations of the numbers n — 1, n — 2,-++ 0; it is 
therefore equal to 


felha), + + +, {e(hw)}, 1| 


or to the product of the differences of the expressions e(h,w), 
e(hgw), +++ obtained by subtracting any member of the set from 
any of the earlier members. On allowing w > 0 we have 


e(hyw) — e(hyw) ~ iw(hy — hg). 


The dimensionality NV of the representation denoted by 
Of, Ja °° fa) in the above is consequently 


D(h,, hs, ` É hn) 


V=Din—1, 0) 


(16.10) 


Evaluation of the Characters of ws—Having obtained explicit 
expressions for the characteristics of the representations of u, 
we now employ the connection between the representations of 
ay and u, developed in § 9 to evaluate the primitive characters 
of wy. In equation (9.12) x is the character and X the char- 
acteristic of the irreducible representations of m, and Uy, re- 
spectively, generated by the symmetry pattern P(f Ja - > +); 
in particular we must put X = 0 if the pattern has more than 
n rows. The sum is extended over all possible symmetry 
patterns P with f fields. The expression (16.9) for X then allows 
us to enunciate the following rule for the evaluation of y: Let 


Xfite rar (1429 . -) (16.11) 


denote the value of the character of the irreducible representation 
H(A, Ja © © °) of ay, which is generated by the symmetry pattern 

1 Ja °**), for an element s belonging to the class f= (izt -- °). 
Choose an arbitrary positive integer n and construct the sums 
01, %, °* + of powers of n independent variables €, €x, * ` *, En and 
the product D(&, &, © * *, En) of their differences. The term (16.11) 
is then the coefficient of the term ebekt--- e™ [hi = fe + (n — i)] 
in the expansion of 


Dle, Eg ° © t, En) OP Ox + +. (16.12) 


We here assume that the pattern P has at most n rows; hence 
if we wish to obtain all primitive characters of my we must choose 
n=f. The rule shows that the components of the characters 
are integers, 

This result was obtained. by Frobenius in a purely algebraic 
manner, without introducing the continuous group u.7* But 
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I believe that the real reason for the rule comes to light only 
when we consider this connection between the groups m, and 
U,—in particular, it enables us to understand why a second 
integer in addition to fis involved. 

The dimensionality g of (fi, Ja -° +) is obtained by substitut- 
ing the argument s=I1; h =f, t= t= '' += 0 in the 
character y. Formula (9.12) is then 


of = XgX, 


where the sum is extended over all patterns P(/,, f2,°* +). Since 
o, is the characteristic of the n-dimensional representation 
c: S—>S of the group u by itself, this merely means that in 
the complete reduction of (c)f the irreducible representation 
© = O(f, Ja ` * -) appears exactly g times, as we already know. 
On substituting the explicit expression (16.9) for X we obtain 


ol: en e. E, 1 | = Se: e’, ght To e™|, 


g is accordingly equal to the coefficient of cP e} >>- eh in the 
expansion of the product on the left-hand side. The term 
+ ek. |- eh in the expansion of the determinant must 
be multiplied by the term 


! 
Í E glia ghs—hs ee 


(hi — ki)! (hy — Be)! 


of of in order to obtain a contribution to the term efet...» 
of the product. (kı, ke, > + *, kp) here run through the per- 
mutations of 7 — 1l, +», 1, 0 and g is accordingly equal to the 
alternating sum 


l 
| eee eee ese ee 
AS, = (hı — ky)! (ha — ka)! °° 
(k) 
over these permutations, or equal to the determinant 
Pa eee O 
(h—n-+ 1)! ' (h— 1)! A!) 
O a a 
mera (h — n+ 2), > , h, L. 


The rows of this determinant consist, on reading from right to 
left, of polynomials in A of degrees 0, 1, +--+, (n — 1) with highest 
coefficient 1. The determinant is therefore , 


[hei ssh, L| 
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and we finally obtain the simple formula 


oa Lh, ho, ` my hy) 


‘nena 


§ Ty! fal hl . (16.13) 


n is to be taken at least as large as the number of rows in the 
pattern P(fi, fa ° °°); the reader should convince himself by 
direct calculation that the value of (16.13) remains unchanged 
on replacing n by n + 1. 

Frobenius’ rule for the character and this formula for the 
dimensionality are vastly superior to (14.7) for purposes of 
practical evaluation. 

As an example, we carry through the computations for the 
case of four electrons ; the results are given in the table below. 
The group m, contains twenty-four elements which are divided 
into five classes of conjugates ; each of these classes is designated 
in the second column of the table by the values (7,7, © - -) as- 
sociated with it. The first column contains the number of 
elements in each of these classes, and the sign + or — indicates 
whether the class consists of even or odd permutations. Each 
of the five remaining columns contains the values of a primitive 
character for the classes in whose row they stand. The symmetry 
pattern to which each of these characters belongs is indicated at 
the head of the column by the numbers fı, fy, - + - of elements in 
its rows, The first and the last of these columns may be filled in 
immediately, and the second and third with the aid of Frobenius’ 
rule. The fourth is then obtained from the second on noting 
that its symmetry pattern is the dual of that of 2; we need 
then merely to replace the values in the second column by their 
negative for the (-)-classes. Since patterns 2 and 3 contain 
but two rows we may take n = 2. Hence on writing x, y in 
place of £, ¢. we have merely to find the coefficients of x4y (for 
the column 31) and xy? (for the column 22) in the following 
polynomials : ) 


(x — y)(x + yy), 
x — y)(% + ya + y’) = (x + yla — y?) (x? + 9?) 
= (x + y) (xt — y’), 
(x — y)(x? + y?)?, 
(x — y(x + y) (a + y’) = (e — y’) + y’), 


(x — y) (st + y’). 


The dimensionalities of the five irreducible representations are 
contained in the first row ; they are 1, 3, 2, 3, 1. The verification 
of the orthogonality relations is left to the reader. 


al i a ma e - 
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211 1111 


§ 17. Calculation of Volume on 1 


Consider the line elements going out from the unit point I 
on the group manifold u, i.e. the infinitesimal unitary trans- 
formations dS = ||8sag||. We may take as the real components 


, „e l 
of this ‘‘ vector” the n quantities 7: ÒSax and the real and 


imaginary parts of the n(n — 1)/2 quantities dsag(a@<); the 
total number of components is thus n?, which is therefore the 
dimensionality of the group manifold u. Now in a linear algebra 
of this kind we may replace any two real quantities a, b by the 
complex quantities a+ ib, — æ + tb obtained from them by 
a simple linear substitution; we may therefore replace the 
real and imaginary parts of Ssug(a#<f) by Ssag itself and 
— S Sug = 8S gx. 

On transporting such an infinitesimal vector to the point 
S on the group manifold by a left-translation its terminus goes 
into the point S + dS = S(1 + 8S), dS = S-S; we must 
therefore consider the infinitesimal element 5S = S-1dS as the 
“vector” which leads from S to S+ dS. Our definition of 
volume on the group manifold [III, § 12] consisted in the 
following: the parallelepiped defined by n? vectors 5S leading 
from the fixed point S to the neighbouring points S + dS has 
as volume the absolute value of the determinant formed from the 
components of the n? vectors ôS. In accordance with the above 
remarks we may take as components of the vector 5S = ||Ssaql| 
the totality of coefficients Ssag themselves. 

Any S can be expressed in the form 


S = UEU- (17.1) 


where £ is a principal (diagonal) element of u and U is unitary. 
S is unchanged on multiplying U on the right by any principal 
element. We employ a geometrical terminology which will 
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allow us to visualize our procedure by means of an analogy. 
Two elements U, U’ of u which are right-equivalent with respect 
to the group of principal elements: U’ = UE, will be said to 
“lie on the same vertical [U]. From the n?-dimensional mani- 
fold u we obtain by projection the (n? — n)-dimensional mani- 
fold [u] of verticals [U] on considering all points of u which 
belong to the same vertical to be coincident. This process of 
identifying equivalent elements was described in general in the 
beginning of Chapter [lI—we had, in fact, already met it in J, 
§ 1, in the special case of projection in affine space. We may now 
consider U in (17.1) merely as a representative element of the 
vertical [U]; on allowing [U] to run through the entire mani- 
fold fu] and the angles w, of E: 


e(wy) 
e(we) 


e(wn) 


to vary independently over the complete range 0 S w < 27 
the element S defined by (17.1) describes the manifold u exactly 
n! times. 

The vector òU = U~1dU leads from the point U of the vertical 
[U] to the neighbouring point U + dU of the vertical [U + dU]. 
The totality of all points on [U + dU] which are in the neigh- 
bourhood of U is given by expressions of the form 


(U + dU)(1 + 8E) = U + (dU + U 8B) 


where 5£ is an arbitrary infinitesimal principal element with 
coefficients 7 òw, on the principal diagonal; the corresponding 
vectors are ŠU = U + $E. Since the terms in the, principal 
diagonal of òU are pure imaginary, E may be uniquely deter- 
mined in such a way that all terms in the principal diagonal of 
òU vanish; we call this transition from [U] to [U -+ dU] the 
“ horizontal transition from U.’’—The transition from some other 
point UE of the vertical [U] to the point (U + dU)E of [U + dU] 
is accomplished by means of the vector 


S'U = E-1-8U- E. (17.2) 


That this linear transformation (17.2) determined by E, which 
sends U into 8’U, is unimodular follows from our general re- 
marks concerning closed continuous groups—and can in this 
case be readily verified by direct computation. Naturally this 
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same equation holds for the horizontal transitions ŜU, S'U from 
U, UE respectively: _ 
YoU = E~. òU. E. (17.3) 


n? — n horizontal vectors U leading out from U determine an 
infinitesimal ‘‘ parallelogram ” whose content is measured by 
the absolute value of the determinant of the n? — n components 
Sttag (x = B) of the various vectors dU. On allowing each point 
U on the periphery of the parallelogram to describe the vertical 
[U] we obtain a tube whose horizontal sections are parallelo- 
grams; its projection on [u] is the original element of volume, 
the ‘‘ parallelogram ’’ defined by the 6U. Since the linear 
transformation (17.3), ŠU —> 8’U, is unimodular, the content of 
each horizontal section is the same, and may therefore be con- 
sidered as the content of the volume element on [u]. 

We now examine the variations in [U] and E in (17.1) when 
S goes over into S + dS. We have 


SU = UE 
and therefore 


dS*U+S-dU=dU-E+U-dE. 


On multiplying both sides of this equation by U-1$7! = H-1U~! 
we find 

U-1-§S-U+8U0=E87-80U-E+6E 
or 


S'S = U~. 8S- U = {E71 . ŠU - E — U} + 8E. (17.4) 


The components of the matrix contained in parentheses are 


€ 
Suina ( 2 — 1). 

We now define a parallelepiped at S which shall serve as a 
volume element in the following manner: n?— n of the n? 
sides 5S are obtained from (17.4) on allowing the angles of 
rotation to remain fixed, i.e. E = 0, and drawing n? — n hori- 
zontal vectors 8U from the point U to form a volume element 
of magnitude d[U] on [u]; the remaining n vectors ôS are then 
chosen in such a way that for each of them one and only one of 
the angles w, changes by dw, and [U] remains unchanged. The 
corresponding n? vectors 6’S define, in accordance with (17.4), 
an element of volume of magnitude 


J](È-— 1) + 4U]: deoy deog + ++ deg, (17.8) 


pp ?” 
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Since the linear transformation òS —> 6’S = U1- ôS» U is uni- 
modular this volume is equal to that of the element defined by 
the 5S themselves. Since € = l/e the product II in (17.5) can 
be written 


The final result is: The volume element described by S on allowing 
[U] in (17.1) to describe an infinitesimal volume element of mag- 
nitude d[U] on [u] and on allowing the angles of rotation w, to vary 
by dw, has the magnitude 


AB de, dw, © > + dw,-d{U}. (17.6) 


On integrating with respect to d[U] over [ut] we obtain the 
theorem, already applied in the preceding section, concerning 
the magnitude of that portion of u in which the angles of rotation 
have values lying between w, and w, + dw,. 

These considerations remain valid on restricting ourselves 
to the group u of unitary transformations with determinant 1. 
The angles of rotation are then subjected to the restriction 


wo, tH ot e Hon O, (17.7) 


and the only difference in the result is that the factor dw, in 
(17.6) is to be omitted. Condition (17.7) allows us to normalize 
the linear form hyw +: + - + hawn in the angles of rotation in 
such a way that k, = 0; the exponents (hy, ha * - *, ha) in the 
weights of the representations of u are then non-negative integers. 
It is desirable, however, not to impose this normalization 2, = 0 ; 
we need then only to remark that only the differences between 
the h; are of significance: the irreducible representations 
D(f, Ja ***, fa) of u are unchanged on increasing each of the f; 
by the same integer. In particular, these considerations justify 
the expression used in Chapter III for the volume on the group 
manifold of the unimodular unitary group Us, and the results 
of the preceding section constitute a direct proof, which is inde- 
pendent of the completeness theorem, of the fact that the 
representations of t, denoted by ©, constitute a complete set of 
inequivalent irreducible representations of U,. 
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§ 18. Branching Laws 


Finally, we show the usefulness of our formule for the 
characters by deriving two simple ‘‘ branching laws ” from them. 


l. Branching law for the Permutation Group. 


The irreducible representation of a; with the symmetry pattern 
Pifi, fe, + © *) reduces, on restricting ny to the sub-group ns of 
permutations of f—1 things, into the sum of those irreducible 
representations of Tts, associated with the patterns 


P( fi — 1, fa fs, n ); 
P( fo fo — 1, fatte); 


those patterns in which the rows are not arranged in decreasing 
length are to be omitted. Each such constituent appears exactly 
once. (In words, these patterns are obtained from the original 
one by removing a field in turn from the end of each row which 
is actually longer than the following one.) 

Proof. Let s be a permutation of the numbers 1, 2,° °°, 
f — 1 belonging to the class (2; — 1, 29, 73, + + +). Considered asa 
permutation of the f numbers 1, 2, +» -, f, s leaves the last number 
fixed; the number of one-term cycles is thus increased by 1 
and s, considered as an element of ~y, belongs to the class 
(ii 22, tg °° *'). In the expansion 


A. ght oft oe = Lay y,... E4184? oe (18.1) 


we have as the coefficients of those terms for which 
hy 2 h 2 


Wh, = 0 or Xfafar (s) (18.2) 


according as any of the signs < in the above inequalities is 


actually = or not. xy is the primitive character of 7,_, belong- 


ing to the symmetry pattern P(fi, fa +*+). On the other hand, 
the coefficient of ef e} -+ © [hy > ha>- +] in At ofo.. is 
equal to the character Xii: .(s) of the representation of ay 
with pattern Plh, fa °° *). Hence on multiplying (18.1) with 


Oy =€,+&+-:-->+€, we find 
Xft- ($) = Ahil, hig, ħa e + ap, hg—1, hz". + = 


Our branching law follows from this result and (18.2). The 
branching law leads to a recurrence formula for the dimension- 


alities g(ft, fo, °° *) 


-O naianei SER a 


BRANCHING LAWS 391 
2. Branching law for Cy. 


On restricting tn to the sub-group of linear transformations of 
an (n —1)-dimensional sub-space the irreducible representation 


(fy, fa °° *) Of Cn reduces into the sum of all those representations 
(fi, fa, © © *) Of Cua for which 
h Zh2h2fhr,2° — Z fa- Z fa; (18.3) 


each of these constituents appears exactly once. 

Proof. The linear transformations S of the sub-space ¢,_;: 
%_, = 0 are simply isomorphic to those linear transformations 
S of the variables x,, %2, °° *, Xa in which Xp —> Xn. Hence e, 
is to be replaced by 1 in the characteristic (16.9). The denom- 
inator is then 


D(€y, &2,° * En—1) © (€x — L(ee— 1) + * + (Ena — 2), 


as can be seen by subtracting the last column of D(e, &, © 5, 
&,-1, 1) from each of the previous ones and factoring the resulting 
(n — 1)-row determinant. In order to divide the determinant 
in the numerator by the factor (e, — l)(eg — 1) +--+ (€&,-3 — 1) 
we subtract the second column from the first, the third from the 
second, -- :, and finally the nt? from the (n — 1)§*. The last 
row then is 0, 0,°-:, 0, 1; the determinant is thus reduced to 
a determinant of order (n — 1). Now divide each element in 
the sth row by s, — 1 in accordance with 

gh, — għa 


ease 


e — 1 


— għ +. eos -t ehs, 


The result is that we then have in the numerator the determinant 
e*t + eee + e, ghal -+ e. |. ehs, . -| 
(e = Ei, Ea, sy En): 
But this is the sum of all (n — 1)-rowed determinants of the form 
le", e's, e. e s enal, 
h> h Z h> h 2h >c >h 2h, (18.4) 
On subtracting n — 1 from Ay, n — 2 from h, and hy, >, 0 
from hi,_, and hy, in order to obtain the numbers f [(16.7)], the 


inequalities (18.4) become the inequalities (18.3) and our theorem 
is proved. 
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Proof of an Inequality 
(Page 77.) 


In order to. prove the inequality stated on page 77 we must 
show that any continuous and differentiable function %, which 
is defined for all values of the real variable x, satisfies the 
condition 


W7fe Nt, “Taga 
a( 7v) s< [Apax ‘las - dx, (*) 


provided, of course, that the integrals involved actually exist. 
The Schwarz inequality 


a,b, ari as anbal? S (ad tt Andy) (byb; aes b Dn) 


employed in Chapter I becomes, on replacing the sums by in- 
tegrals—or rather each sum by two integrals— 


|ffigde + [hagda < (ffx + fefedx)(eridx + | erd.dx). 
Applying this inequality to 


d d d 
E + ah 
by taking 


d d 
J= fp = xb, n=% n= a4 


and transforming the integral 
[eas into — [shar 


by partial integration over the range —oo, +00, we obtain the 

desired relation (*) provided the term xx, which is integrated 

out, approaches 0 as x -> + œ. That this is actually the case 

if the two integrals on the right of (*) converge can be seen by 

the following indirect proof. Let e be any pre-assigned positive 
393 
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constant and consider a positive value of x for which x | (x) |? >e 


0 
a dyp|? 
and which is so large that | => 


dx 
| EZ 


then tells us that for x Sx’ S&S x + = 


l , . 
dx S re The Schwarz inequality 


< (x’ — x) vE “de 


|e) — a) SEE, whence |y] 2 u] — gyE> ba 


The integral of x? | |2 over the range from x to x + = is then 


Hence it follows that conversely 


CE 


2 z 


imply the inequality 
x | (x) |? Se. 


APPENDIX 2 
A Composition Property of Group Characters 
(Page 169.) 
Tue fundamental property of the irreducible representation 
H : s - U(s) which is expressed in the equation 
U(st) = U(s)U(t) 


is paralleled by the relation 
x(s)x(d) = j Erxl"). (*) 


Proof. lf x, y are two elements of the algebra of the group, 
the second of which belongs to the central, and if 


x> X, yo Y in , 


then Y = zl The matrix associated with z = xy in is 


1x and its trace is gn : 
£ £ 


On setting 


we find 


s, t $f 


Since y(t) depends only on the class of conjugate elements to 
which ¢ belongs we may replace 


x(st) by z Ex(sr¥r) 


on the left-hand side of the previous equation. Then the co- 

efficient of x(s)y(t) on either side of the equation depends only 

on the class to which the element ¢ belongs, and since x(s) is an 
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arbitrary function, y(t) an arbitrary class function, the assertion 
(*).follows from the fact that the two coefficients must agree. 
We have omitted mention of this equation (*) in the text 
in order not to interrupt the systematic development of the 
theory of representations, which is completely described by the 
orthogonality relations and the completeness theorem. 


APPENDIX 3 


A Theorem Concerning Non-degenerate Anti- 
symmetric Bilinear Forms 


(Page 274.) 


WE consider the given non-degenerate anti-symmetric bi-linear 
form 

f 

L Cie Xi Vie (Cki = — Cir) 

ik=1 

as the “ anti-symmetric product” [ry] of the two vectors 
g = (Xi, Xa tt, xy) and Y = (y,). Let e, be any non-vanishing 
vector; then by assumption [ex] cannot vanish identically in 
r, and consequently a second vector e, can be found such that 
[e€] = 1. The simultaneous equations 


[e] = 0, [esx] = 0 


then have f — 2 linearly independent solutions €g, © ++, es. These 
vectors are furthermore such that no linear dependence can 
cxist between them and @¢,, ĉa for if 


r = Eye) + Eee. + Eae +> + > + Spey = 0, 


it follows on building the anti-symmetric products [et] = &, 
[ex] = — é& that €,=&=0. We may therefore choose 
1, Co, °° *, ey as a co-ordinate system, i.e. as a basis from which 
all vectors may be constructed. Let the anti-symmetric pro- 
duct be expressed in terms of the components é; , of x, Y in 
this new co-ordinate system by 


f 
[eo] = Lyin Eine 
ik = 1 


The manner in which the new fundamental vectors were deter- 
mined requires that of the coefficients yi = [e;¢, 


Yu = 0, Yiz = l; Yig = My, 7, Vir = O, 
Ya = —l, Yoo = 0; Yop = O° ts, Yor = Y. 
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In consequence of the anti-symmetry all yx, Yia with i = 3, -- +, f 


vanish, and the matrix of the y,,is completely reduced into the 
2-rowed square sub-matrix 


O 1l 
—1 0 
and an (f— 2)-dimensional anti-symmetric matrix. Mathe- 


matical induction with respect to the dimensionality f yields the 


desired theorem that f is necessarily even and that the original 
form can be transformed into 


(Ema — Em) + (Esa — Sans) +° * + (7/2 terms) 


by an appropriate linear transformation. 
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by M. Born cited in (*). For canonical transformations in quantum 
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long before the development of the new quantum mechanics and 
had derived equations (13.9), (13.10) : Phys. Zeits. 18, 121 (1917). 
The new quantum mechanics completes the derivation by obtaining 
the probability coefficient A, eq. (13.8), from the structure of the 
atom. 


(30) 109. V. Weisskopr and E. WIGNER, Zeits. f. Phys. 63, 54 (1930). 


CHAPTER III 


(1) 110. For the general foundations of the theory of groups and 
the development of the theory of finite groups see: W. BURNSIDE, 
Theory of Groups of Finite Order, 2nd ed. (Cambridge 191 l); 
G. A. MILLER, H. F. BLICHFELDT and L. E. DICKSON, Theory and 
Applications of Finite Groups (New York 1916); A. SPEISER, 
Theorie der Gruppen von endlicher Ordnung, 2nd ed. (Berlin 1927). 

(2) 112. Vergleichende Betrachtungen über neuere geometrische 
Forschungen (Erlangen 1872); Math. Ann. 43, 63 (1893); or 


F. KLEIN, Gesammelte mathematische Abhandlungen, Vol. I, 460 
(Berlin 1921). 
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(3) 120. Following the fundamental results of T. MoLIEN on th 
of hyper-complex numbers (Math. Ann. 44 and 42, 1898) the 
theory of representations of finite groups was developed princi- 
pally by G. FROBENIUS (Sitzungsber. Preuss. Akad. 1896-99). The 
most important general results were re-discovered by BuRNSIDE— 
cf. his book cited in (1) above. The method developed by I. Scuur 
Neue Begriindung der Theorie der Gruppencharaktere, Sitzungsber. 
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clarity. 

(4) 134. The development of § 6 follows E, NOETHER, Math i 
30, 641 (1929), in particular §§ 3 and 16. The uniqueness ne 
plete reduction rather than reduction follows in general W.- KRULL 
Math. Zeits. 28, 161 (1925); O. Scumipr, Math. Zeits. 29, 34 
£1928) ; R. BRAUER and I. Scuur, Sitzungsber. Preuss. Akad. 1930, 

5) 152. ScHUR'’S treatment of the theory o ions. ci . 

(5) (8) is based on this lemma. ry of representations, cited in 

(6) 153. W. BURNSIDE, Proc. Lond. Math. Soc. (2), 3, 430 (1905). 

(7) 156. G. FROBENIUS and I. Scmur, Sitzungsber. Preuss. Akad. 
1906, 209. 

(8) 161. The method of integration over the group manifold is due to 
A. HURWITZ, Gött. Nachr. 1897, 71, although it was applied by him 
to the theory of invariants rather than to the theory of groups. 
I. ScaurR first obtained the orthogonality properties of the 
characteristics of the continuous rotation group in this way and 
used them to prove the completeness of the system of known 
representations : Sitzungsber. Preuss. Akad. 1924, 189, 297, and 

(9) 166. For a modern book on algebra see L. E. Dickson, Algebras 
and their Arithmetics (Chicago 1923); the German edition, 
Algebren und ihre Zahlentheorie (trans. by J. J. BURCKHARDT and 
E. SCHUBARTH, Zurich 1927), follows an author’s revision which has 
not appeared in English. Also B. L. VAN DER WAERDEN, Moderne 
Algebra II (Berlin 1931). An algebra was previously called a 
“system of hyper-complex numbers,” and is at present. to some 
extent in the German literature; the algebra of a group is there 
referred to as a ‘‘ Gruppenring.’’ The usual procedure in modern 
algebra consists in reducing the algebra into simple matric 

algebras, in which case the theorems on realization by linear trans- 
formations appear as corollaries ; this development will be followed 
in Chap. V. 

(10) 178. See R. WEITZENBOCK, Invariantentheorie (Groningen 1923). 
The foundation for the proof of the fundamental theorem of the 
theory of invariants is the HILBERT basis theorem : D. HILBERT, 
Math. Ann, 36, 473 (1890). The author has shown (Math. Zeits. 
24, 392, 1926) that the fundamental theorem is valid for any closed 
and for any semi-simple continuous group. The older theory of 
invariants was almost exclusively concerned with the group tn 
of all linear transformations with unit determinant. A really 
modern. book on the theory of invariants is lacking. 

(11) 175. The theory has been presented by S. Lre himself, with the 
assistance of F. ENGEL, in a huge three-volume work : ‘Theorie der 
Transformationsgruppen (Leipsic 1893, 1930). See also S. LIE, 
Vorlesungen iiber kontinuierliche Gruppen, ed. by G. SCHEFFERS 
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Rev. 85, 569; 36, 383 (1930). W. HEITLER and G. Rumer, Zeits. 
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OPERATIONAL SYMBOLS 


The number refers to the page on which the symbol is defined 


with .. . is associated . . . 110, 114. 

is contained in 290. 

conjugate complex of x 15. 

transposition: for operators 13, symmetry quantities 352, 
symmetry patterns 361. 

Hermitian conjugate: for operators 17, elements of an 
algebra 167. 

a(s) = als) 296. 

contragredient matrix 123, representation 123. 

equivalent as correspondences of the ray field 21. 

transforms as 145. 

scalar product 16, 32. 

vector product (in 3-dimensional space) 27; commutator 


[HA] = =(HA — AH) 264. 


temporal mean value 88. 

for vectors 90, vector spaces 90, correspondences 91, 
representations 126, groups 127, algebras and their 
elements 333. 

multiplication of representations of two groups 127. 

addition of representations 113. 

transition from p to $ 287. 

transition from $ to p 290. 
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LETTERS HAVING A FIXED SIGNIFICANCE 


The number refers to the page on which the quantity is defined 
LATIN 


c velocity of light; a Young symmetry operator 359. 

e primitive idempotent element (generating unit 291); — e- 
charge of the electron. 

— ere. 

(E, E,, E,) = © electric field strength 99. 

E, energy level 44. 

f — number of electrons, order of a tensor 139, 281. 

fa  -vector potential multiplied by e/ch 214. 

fag curl of fa (= ale Ri 216. 

F action of the electro-magnetic field 215. 

F(i tg, . . ., iy) tensor 139, 281. 

g dimensionality of a group representation 120, Landé g- 
factor 204, 207. 

h  Planck’s quantum of action divided by 2r 51, order of a 
finite group 118. 

H energy 51. 

(H,, Hy, H,) = Q magnetic field strength 99. 

I signature 188. 

Jj, F inner quantum number 189, 190. 

Fa total energy-momentum vector 220. 

k auxiliary quantum number 228. 

l, L azimuthal quantum number 64, 185, 194—for s, p, d, f, g, 

. terms l = 0, 1, 2, 3, 4, 

(Le, Ly, Le) = & orbital moment of momentum 63. 

m magnetic quantum number 64, 193, multiplicity of a re- 
presentation 321, 350; (= p) mass of the electron. 

m, = mcjh. | 

M, M’ action of the material field 211. 

(Ma, My, M,) = M total moment of momentum 179, 187. 

n dimensionality of a vector space 1; principal quantum 
number 69. 
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LETTERS HAVING A FIXED SIGNIFICANCE 411 


LATIN 

b, q canonically conjugate variables 94, a permutation in the 
rows, columns of a symmetry pattern 359, 

(Px, Pv, Pz) = p linear momentum of a particle 51. 

P symmetry pattern 358. 

(dæ, Iv; Gz) = q electric dipole moment 83. 

r distance from centre. 

S element of a group ; spin quantum number 206. 

(Sæ Sy, S2) = 3 electric current density 218, s* charge-current 
4-vector 214. 


(Sa, Sy, S,) = & spin 178, 203. 
Wy 0 0 1l 0 —1 l 0 
So = , $1 = , S = S a = 148. 
l 1 1 0 — l 


Ps energy-momentum tensor 218. 

T interchange of yy, pa and py’, Pa 149. 

v — valence 369. 

W perturbation energy 86, total action 216. 

Xo X14 Xa Xz ort x y co-ordinates of space time (t = % 98, or ct = %% 
l 


GERMAN. (For -dimensional vectors see their components 
under Latin letters.) 
¢ = ¢, group of (unimodular) linear transformations in n dimen- 
sions 128. 
(c)f representation of ¢ whose substratum is the tensors of 
order f 125. 
C, = D,(v = 27) representation of vth degree of c, or u ~ D; 
D, orthogonal group in n dimensions 142; D, same but in- 
cluding improper rotations 143. 
D™ 1-dimensional representation of rotation group D, 141. 
eu êa . . ., &, co-ordinate system in vector space 2. 
unitary representation of the rotation group induced in 
the function space of W(x y z) 143. 
abstract group 114 
conjugation 118. 
mean value 158. 
representation of the rotation group induced in system 
space 187. 
p, P invariant sub-space of r, R/ respectively 287, 282. 


Byte 


rt 


an algebra considered as a vector space 286, ty = t = ' R 
290, 350. 
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GERMAN 


R vector space |, Rf corresponding space of tensors of order f, 
[Ri] space of the symmetric tensors, {R} space of the 
anti-symmetric tensors, 239, 242. 

Ri, Ra system space of clectron translation, spin 196. 

ta  left-translation 116. 

u = u, (unimodular) unitary group in n dimensions 139, 

V ray representation giving rise to algebra of complex 
quaternions 182. 

t vector in » dimensional vector space 1. 


GREEK 


a == e*/ch fine structure constant 216, 
dy, Kronecker symbol = 1 or 0 according asi = k or îi + 17, 
+a 


5(x) Dirac 8-function (= Ọ except for x = 0 and )8(x) dx == 1) 
255. 


5, == + 1 according as s is an even or an odd permutation 121. 
ò signature 201. 


Pit 
= Hii y? a 


y = X Se 55 va 212. 


E€ Š nerating clement of a right- and left-invariant sub-space 
811. 

6, polar co-ordinates 60. 

p(== m) mass of the electron. 

v frequency 50. 


e 


o= 5 Larmor factor—unit of Zeeman separation. 

T = is Mf symmetric group of permutations of f objects 121. 
p electric charge density 218, an algebra 304. 

a electro-magnetic 4-vector potential 98. 

p vector defining the state of the material field 49. 

xy, X group characteristics, 150, 151. 

w angle of rotation 151. 


Laplace’s operator « 52. 


INDEX 


The numbers refer to pages of the text, those in boldface to the pages where the 
concepts introduced in boldface are defined 


Abelian group 118, its unitary irreduc- 
ible representations 140, in ray space 
182, quantum kinematics as A. g 
of rotations 272 ff. A. system of 


forms 25. 
Absorption of photon 44, quantum 
theory of a. 107, 224, 261, a. lines 45. 


Action of material field 211, of electro- 
magnetic field 215, total 216, 222, 


Adaptation of co-ordinate system to 
sub-space 3. 


Addition of vectors I, 
ences 6, of matrices 7, of repre- 
sentations 126, of elements of an 
algebra 165, 303, of numbers of a 
field 302, direct sum of algebras 311, 


of correspond- 


Affine correspondence 5, see Corre- 
spondence, linear; a. geometry 1 ff., 
112. 


Algebra, general concept 303, of group 
66, 181, 286, simple 311, 313, 
semi-simple 316, order of a. 304, 


modulus or principal unit 168, 304, 
basal units 168, 304, division a. 
(= field) 304, 316, central of a. 167, 
311, invariant sub-a. 167, 280, 
generating unit of s.-a, 168, 291, 
irect sum $11, direct product 333, 
reduction into simple matric a. 167, 
309 ff., 315 ; — representation of a. 166, 
304 ff., regular representation 289, 
complete reduction of representation 
306; — a. of complex quaternions 182, 
of linear transformations 307, of 
symmetric transformations 282, 332, 
its enveloping a. 284, reduction of 
a. of linear transformations 307 ff. 


Alkali spectrum 85, 86, 202, doublets 
in 204, with anomalous Zeeman 
effect 205. 


Alkaline earth spectrum 207, 246. 
Alternation 358. 


Atom, Rutherford’s model xiii, 


Alternation law 207, 370. 


Bohr’s 
theory of a. 43, radiation on classical 
and Bohr theories 44, on quantum 
theory 104 ff., 256 ff., Hund’s vector 
model of a. 191, 244; see Spectrum. 


Automorphism 115, automorphic corre- 
spondence of group 134, 


Auxiliary quantum number, see under 
Quantum number. 


Azimuthal quantum number, see under 
Quantum number, 


Balmer 45. 


Bessel’s inequality 33, 
representations 169. 


for system of 
Black body radiation 41, 104, 256. 

Bohr, H. 39. 

Bohr magneton 66, 205. 

Bohr, N. xil, 43, 95, 105, 236, 245. 
Boltzmann 108, 

Born 48, 74. 

Bose 50. 

Bounded Hermitian form 39. 

Brackett 46. 


Branching rule, for spectra 207, for 
linear and permutation groups 390 ff. 


de Broglie, L. 48, 53, 211, 220. 
Burnside’s theorem 153. 


Canonical variable 52, 94, c. trans- 
formation 96, in quantum mechanics 
98, c. aggregate 79, c. basis for 
rotations in ray space 274. 


Central, of group 118, of algebra 167, 
313. 
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414 INDEX 


Character, group or group character- 
istic 150, 327, 395, of unitary re- 
presentation 156, primitive c. 150, 
150, behaviour on addition and 
multiplication 151, orthogonality pro- 
perties 156, 159 f., 317. For char- 
acters of special groups see under 
qualifying adjective. 


Character of element of algebra 295. 


Characteristic number of Hermitian form 
or operator 21, 35, of unitary form 26, 
multiplicity of c. n. 22, 26, of energy 
56, 80 ; — characteristic vector or func- 
tion 21, 35, of wave equation 56, 80; 
— c. spacé 22, of energy 80, 192, of 
moment of momentum 189, 192. 


Class of conjugate elements 118, in 
symmetric permutation group 328; 
— c. function 150, 156, as element in 
central of group algebra 169. 


Classical mechanics compared with 
quantum mechanics xiii, 73, 81, 94, 
190, ‘‘c.” combination principle 47, 
82. 

Clebsch-Gordan series 128, 163, 190, 
371, as quantum rule for composi- 
tion of moment of momentum 190, 
as valence rule 371. 


Closed shell 86, 245. 
Cogredient transformation 5. 
Collision phenomena 46, 70 ff. 


Combination principle, Ritz-Rydberg 
44, 48, 82, “classical ” 47, 82. 


Commutation rules, Heisenberg’s 94, 
274, interpretation of 275, wave 
equation derived from c. r. 277 f., 
c. r. for infinitesimal rotations 178, 
for moment of momentum 179, for 
spin 227, in second quantization 249, 
for Maxwell-Dirac equations 254 ff. 


Commutative field 302, c. group 118, 
c. operators transformed simultane- 
ously to principal axes 25. 


Commutator 177, 264, 267. 
Commutator form 273. 


Completeness of unitary-orthogonal sys- 
tem of functions 8, of spherical 
harmonics 62, on group manifold 
170, c. of system of unitary repre- 
sentations 140, 159, 170, 305, 318, 
of product representation 164 ;— com- 
plete system of orthogonal vectors in 
3-space 257. 


Complete reduction of correspondences 


or representation 9, 122, sometimes | 


equivalent to reduction 18, 123, 136, 


292, 301, 306, 308, of product re- 

presentation 140, of ©; x @, 128, 

190, uniqueness 136, 156, c. r. of 
system space with respect to ener 
80, of representation induced in 
system space by Ð, 188, of group 
space 294, of tensor space 301, of 
an algebra into simple matric algebras 
167, 309 ff., 315. 

Composition of physical systems 91, 
behaviour of energy on c. 92, 193, of 
moment of momentum 190, c. of 
equivalent individuals 239, 241, under 
Pauli exclusion principle 244, method 
of c. compared with second quantiza- 


tion 248; — c. of transformations 6, 
110, see Multiplication. 


Composition series, of sub-groups 132, 
of sub-spaces 122, 135. 


Compton effect 224. 
Condon 74. 


Congruent modulo sub-space 4. 


Conjugate of element of group 118, 


for permutation group 328, of ele- 
ment of algebra 167, 


Conjugation 118. 


Conservation law, for electricity 214 f., 
energy 82, 218, 220, momentum 218, 
220, moment of momentum 188, 


221, Dirac’s c. l. 227, of quantum 
field 264 ff. 


Contact transformations 96. 


Contragredient transformation 12, re- 
presentation 123. 


Contravariant vector 13. 
Convex region 79. 


Co-ordinate system, in vector space 2, 

adapted to sub-space 3, transforma- 
tion of c. s. 4, normal c. s. 16, 
21, Heisenberg’s c. s. 80, in special 
relativity 147, in general relativity 
219. 


Correspondence or transformation, 
general 110, identical 110, inverse 
III, product 111, isomorphic 112, 
automorphic 134, similarity 283; — 
linear 5f., 21, = projection 282, 
in function space 35, trace 11, 150, 
dual 13, 123, contragredient 12, 
Hermitian 18, unitary 16, infinites- 
imal unitary 28 ff., rotation of ray 
space 20, x multiplication go, re- 
uction and complete reduction 9, 
irreducible system of 1l. c. 122, 153 ff., 
symmetric c. in tensor space 282, 

or special groups of correspondences 
see under qualifying adjective. 


= 
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tion ofe. c. 214,217, e. dipole moment 
83, 104, 197. 7 


Electro-magnetic field, effect on charged 
particle 98, 213, 222, interaction with 
matter 105, 261, equations of 102, 
218, quantization 104, 253, action 
215. | 


Correspondence principle 95. 

Coupling, Russell-Saunders or (s/) 206, 
(77) 206. | 

Courant 40. 


Covariant linear quantity 173, in 
quantum mechanics 197; — c. vector 
13. 

Cycle of a permutation 328. 


Electron, de Broglie’s equation for e. 
53, Schrödingers 54, III, Dirac’s 
213, e. beams 50, spin 195, 196, 
203, 276, translation 96, in spher- 
ically symmetric field 63, 227, nega: 
tive energy levels and “‘ positive e.” 
225, existence vs. constitution of e. 
261, e. and proton 262. 


Element, of group 114, of group alge- 
bra 166, of algebra 303, idem- 
potent e. 168, 291, independent 292, 
primitive 293, real 295, trace 299, 
317, scalar product 299, character 
of an e. 295. 


Elsasser 74. 


Cyclic group 117; 


Davisson 50, 53) 70- 


- Decomposition, see Complete reduction, 
of space 3, 122, of dual space 14, 
in unitary geometry 18, into char- 
acteristic spaces, 22. 


Degenerate S stem 83, perturbation 
of 86, . accidental degeneracy 192. 


Degree of a representation 120. 
§-function 36, 255: 
Derivative of operator 94. 


Dimensionality of space 2, 3, of a 
representation 120. 


Dirac 109, 210, 211, 217, 225, 255, 260, 
262, 357- 


Dirac’s relativistically invariant equa- 
tions for electron 213, 218, 225, in 
central field 227 ff., quantization of 
253 ff.; — D. theory of proton 262. 


Emission, of photon 44, quantum 
theory of e. and absorption 107, 224, 
es , spontaneous 107, stimulated 
108, 


Energy, and its operator 51 ff., 80 ff., 
97, 187, 215, ©. level 44, 50, in 
collision phenomena 70, in perturba- 
tion theory 86 ff., on composition 92, 
in electro-magnetic field IOI, with 
spin 215, 220, €. of radiation field 
103, 258, e. of simple state 189, 191, 
of system of equivalent individuals 
320 ff., 356, of molecule 346, ex- 
Dispersion 53, 224. change e. 322, 342, 346, e. and 

EE m momentum 51, 210, 220, conserva- 
Division algebra (== field) 304, 316. tion 188, zero-point e. 104, 258, 261, 


Double tensor 347. inertia of e. 221, e. quantum 41. 


Directional quantization 67, 75, 205. 


Dual space 12, matrix 13, system of Enveloping algebra 284, for double ten- 
penton 123, symmetry ele-| sors 348. 
ment and representation 352, sym- : : 
metry pattern, 361, 369. Pane, axioms of 112. 
Dynamical variable, represented by Equivalence degeneracy 239 f., 320. 
Hermitian form 74, 275, Measure- 
ment of 74 ff., mean value or ex- 


ee 75, intensity on transition 


3, 197, composition 91, totality of | Equivalent systems of linear transforma- 
d.v. represented by irreducible system | “tions 121, e. representations 120, 
238; — d. law 54, 80 f., 97, 187, 266. sub-spaces 135, 283, e. points with 
respect to transformation 112, è. 
one with respect to sub-group 
118. : 
ae aa see under | muclidean geometry 15, 112. 


Einstein 42, 50. 


Equivalent individuals, state of system 
consisting of e. i. 239 ff., energy 241, 
320 ff., 356, quantization 246. 


Dynamically independent systems 92. 


Exchange energy 322, 342, 346. 


Expectation or mean value of physical 
quantity 75, 78, 92. 


Exponential function 28, of matrix, 29. 


Electric charge, atomicity of 216, posi- 
tive and negative 262, © © density 
and current density 215, conserva- 
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Factor group 119, 132. 
Faithful realization 114. 
Ferro-magnetism 347. 


Field equations, for electro-magnetic 
field 102, 218, for matter 213 f., 
their quantization 104 f., 253 ff. 

Field, number f. 294, 302, algebraically 
closed 294, commutative 302, finite 
f. of modulus ¢ 303; — ray f. 20, 
vector f. 20, point-f. 110. 


Fine structure, in hydrogen 203, 236 


f, s. constant 216. 


Form, linear 12, bi-limear 13, 16, 18, 
Hermitian 18, unitary 16, commu- 
tator 273, anti-symmetric bi-linear 
273, 397. 


Fourier coefficient 33, series 33, in- 
tegral 39, F. c. or group matrix for 
representation 165. 


Franck 46, 70, 74. 

Frequency 50, Bohr’s f. rule 47, 105, 
109. 

Frobenius 156, 358, 383. 


Function space 32, of quadratically 
integrable functions 143. 


Galois, 132. 
I-process 126. 
Gamow 74. 


Gauge invariance 100, 213, 220, rela- 
tion to conservation of electricity 214, 
217, role in quantization 256, 271. 


Generating function of infinitesimal 
canonical transformation 97. 


Generating unit 291, independent 292, 
in field of complex numbers 295, of 
symmetry class of tensors 296. 


Geometry, affine or vector I ff, 112, 
Euclidean 15, 112, unitary 15 ff., 
characterized by group 112. 


Gerlach 65, 75. 

Germer 50, 53, 70. 

g-factor, Landé, 204, 205, 207. 
Goudsmit 203. 


Group 110 ff., transformations g. 111, 
abstract 114ff., isomorphic 1165, 
automorphic correspondence of g. 


115, 184, commutative or Abelian 
118, cyclic 117, order of finite g. 
118, of element of g. 117, central 


INDEX 


index of sub-g, 
or invariant sub-g, 
119, simple 135, 


118, sub-g. 116, 

118, self-conjugate 
119, 132, factor g- 
direct product 127, closed continuous 
160 ff., Lie theo of continuous g. 
175 ff., g- manifold 160 ff, invariant 
sub-space of f; manifold 291; — realiz- 
ation of g. 114, representation of g 
120, of sub-g. 127: 334; of direct 
product 333, £- matrix 165, algebra 
of g. 166, 151, 280. , For special 
groups, see under qualifying adjectives, 


Gurney 74. 
Gyro-magnetic effect 205. 


Hallwachs 42. 
Hamilton 50, 138. 


Hamiltonian equations, in classical 
mechanics 96, 98, in quantum mech- 
anics 94, in quantum held theory 253. 


Heisenberg xiii, 48, 80, 82, 222, 264, 
347. 
Heisenberg’s co-ordinate system 80, 


Heisenberg- Pauli theory of the quantum 
field 253 ff. 


Heitler 342. 
Hellinger 39, 40. 
Hermite 18. 


Hermitian form or operator 18, non- 
degenerate 18, positive definite 18, 
unit 15, idempotent 23, in function 
space 35, 37, bounded 39, product 
of H. f. 20,- trace 20, characteristic 
number 21%, 3 Fi transformation to 
principal axes for single F, f. 21 ff., 
32, for Abelian system 25; —- H. f. 
represents physical quantity 74, 275, 
characteristizes statistical aggregate 
79, 239; — H. conjugate 17. 


Hermitian polynomials 57 ff. 
Hertz, G. 46, 70, 74. 

Hertz, H. 42. 

Hilbert 39. 

Hilbert space 32. 


Hund’s vector model of the atom 191, 
244. 


Hydrogen atom 45, on Schrödingers 
theory 63 f., on Dirac’s theory 234 ff., 
spectrum 45, 69, fine structure 203, 
236. 


INDEX 


Idempotent Hermitian form. 23, 37, 
independent 23; — i. element of an 
algebra 168, 291, independent 292, 
primitive 293. 


Identity correspondence 6, 110, repre- 
sentation 121. 


Independent, linearly i. vectors 2, i. 
idempotent forms 23, idempotent 
elements of algebra 292. | 

Index of sub-group 118. 


Infinitesimal unitary transformation 28 ff., 
rotation 27 ff., moment of momentum 
induced by i. r. 178, canonical trans- 
formation 96, element of continuous 
group 160, 177. 


Inner quantum number, see under 


Quantum number. 


Intensity, as measure of probability 49, 
i. of dynamical variable on transition 
83, 197, of spectral lines 44, 83, 232, 
in anomalous Zeeman effect 201. 


Interaction between matter and radia- 
tion 104 ff., 261. 


Interchange, of right and left 225, of 


past and future 109, 227, 263. 


Invariance, in special relativity, dif- 
ficulty for quantum mechanics 54, 
Dirac’s treatment 210 ff, i. of 
quantum field equations 268 ff.; — in 
sense of general relativity 219, under 
change of gauge 100, see Gauge 
invariance. 


Invariant of transformation group 117, 
70, in representation space 171, 
classical theory 170 ff. 


Invariant sub-space 8, under system 
of transformations 122, 135, 282, 
left-i. s.-s. in group space 2809 ff., left- 
and right-i. s.-s. 168, 311, in tensor 
space 296 ff., significance in quantum 
theory 320; —i. sub-group 119, 
maximal 132. 


Inverse correspondence 6, 111, element 


of group 114. 
Involution 13. 
Tonization potential 46, 


Irreducible invariant sub-space 122, 282, 
system of linear transformations, re- 
presentation 122, reduction into i. 
constituents 122, 135 ; — irreducibility 
= complete irreducibility in unitary 
domain 136, 292, 301, for reducible 
algebra 305. for algebra of trans- 
formations in completely reducible 
vector space, 307. 
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Isomorphic correspondences II2 
simply isomorphic groups 115. 


Jeans 42, 102, 103. 

(77) coupling 206. 
Jordan-Hélder theorem 131 ff. 
Jordan, P. 261, 280. 


Kinematically independent systems 92, 
190, perturbation of 93. 


Kinematics of system determines repre- 
sentation in: system space 189, 
Heisenberg’s quantum k. 94 ff., as 
Abelian group of rotations 272 ff., 
in second quantization 250, k. of 
spin 195, 203, 276. 

Klein’s Erlanger programme xv, I12. 


Laguerre polynomials 7o. 
Landé, 204, 208. 
Laporte’s rule 201, 203. 


Legendre polynomials and associated 
functions 62, with spin 230. 


Lenard 42. 

Leonardo da Vinci 112. 

Lie 176. 

Light, wave and corpuscular nature of 


48 f, 53. 


Linear, I. algebra 303, see Algebra ; — 
l. correspondence 5, see under Corre- 
spondence; — ]. form 12, 1. covariant 
quantity 173, l. projection = 1. cor- 
respondence 282, |. sub-space 2; — 
l. momentum, see Momentum, linear. 


Linear group, complete c,, 123, simplest 
representations 123 ff., representa- 
tion Gy, of cg 128 ff, its ir- 
reducibility 299, representation G@y,, 
131, 164 ;— reduction of (c)/ equivalent 
to reduction of algebra of symmetric 
transformations nsa f., unitary re- 
striction immaterial 285, result of 
the reduction 301, characteristics 
335 ff., relation to characters of 
symmetric permutation group 326- 
representations of order f 309, 
branching law 391. 


London 342, 346, 370. 


Lorentz group, restricted, obtained from 
c, 147 ff., complete L. g. obtained on 
adding reflection 147, positive and 
negative transformations 147, and 
Dirac’s equations 212 ff., transforma- 
tion induced in system space 268 ff. 


Lyman 45. 
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Magnetic quantum number, see under 
Quantum number. 

Magneto-mechanical anomaly 205. 

Magneton, Bohr 66, 205. 

Magnitude, absolute, of vector 16, 19. 


Mapping 11o, 
Transformation. 


see Correspondence, 


Matric algebra,:simple 168, 313. 


Matrix 7, dual or transposed 13, unit 
6, addition 7, multiplication 8, re- 
duced and completely reduced: 9, 
transformation of m. 8, norm II, 
trace 11 ;—gronp m. 165. 


Maxwell’s equations 102, 218, quan- 
tization of 104 ff., 253, M. action 215, 


Mean value or expectation of physical 
quantity in pure state 75, 78, 92, in 
mixed case 79; — m. V. over group 
manifold 158. 


Measurement of dynamical variable 74 ff 
Metric 15. 

Millikan 42, 245. 

Minkowski, H. 79. 

Mixed state 79. 


Modulus, of algebra 168, 304, reduc- 
tion of 168, 301; — of finite field 303, 


Molecule, spectrum 191, perturbation 
theory constitution 339 ff., non- 
olar bond 342, London formula 
or binding energy 346, on taking 
account of Coulomb forces 356, val- 
ence theory 369 ff. 


Moment of momentum of a representa- 
tion 179, of D; 179 ; — m. of m. of phy- 
sical system 187, orbital 64, 195, 
spin 195, 203, 218, behaviour on 
composition 190, conservation 188, 
219 ff., 227, reduction of system 
space with respect to m. of m. 192, 
induced by infinitesimal rotations of 
Lorentz transformations 185, 269. 


Momentum, lincar, and its operator 51, 


220, conservation of energy and m. 
218, 264 ff. 


Moseley’s law 69. 


Motions, geometrical 111, group of 176. 
Multiplet 196, 206, 373, as relativis- 


tic phenomenon 204, 234, normal 
Zeeman effect 101, 193, 198, anom- 
alous Zeeman effect 204,. 208 ff., 
alkali doublets 204, singlets and 
triplets in alkaline earths 207, 246, 


INDEX 


multiplicity 321, 350, under Pauli 
exclusion principle 352, in 2-dimen- 
sional spin 35 p 369, multiplicity and 
valence 369 ff., branching rule and 
alternation law 207, 370. 


Multiplication, of vector by number 1, 
of correspondences and matrices 6 ff., 
of numbers of field 302, of elements 
of algebra 165, 303, quaternion m. 
138, outer or X-m. of spaces, vectors, 
operators 90, 125, of representations 
126, direct product of groups 127, 
333, of algebras 333, X-m. of repre- 
sentations 127 ; — scalar m. of vectors 
16, of elements of an algebra 299, 317. 


v. Neumann 40, 78. 
Noether, E. 134. 


Normal co-ordinate system 16, in rel- 


ativity 147, n. state of atom 45, 
n. term order 206. 
Number, of field 302, operations on 302; 


— characteristics n. 21. 


Operator = linear correspondence ©, 
Hermitian. 18, in function space 35, 
representing dynamical variable 3 5, 
considered as function of time 31, 
derivative of o. 94. 

Orbit, in older quantum theory 47, 
orbital moment of momentum 64, 
195. 

Order, of finite group 118, of element 


of group 117, of sub-group 118 
of finite algebra 303. 
Orthogonal group, see Rotation group ; 


— o. transformation 16, o. vectors 16. 


Orthogonality relations 32, for group 
characters 159 ff,, 317, for sym- 
metric permutation group 367. 

Oscillator 43, 56 ff., 84, black bod 
radiation as system of o. 102 ff., 258, 
quantum mechanical laws of system 
of o. 249. 


Parseval’s equation 33, 35, 162. 
Paschen 45, 236. 
Paschen-Back effect 208. 


Pattern, symmetry, 
pattern. 


Pauli 77, 203, 211, 244, 264, 347, 351. 


Pauli exclusion principle 207, 244 ff., 
and reduction of algebra of sym- 
metric transformations 281, 323, 347 ff., 
355, 370 ff. | 


see Symmetry 


INDEX 


Peirce reduction 312. 


Periodic system of the elements 69, 
242 ff. 

Permutation II, 
328, conjugate 328, 
tensor 281. 


reduction into cycles 
as operator on 


Permutation group, symmetric 121, 
classes 328, elements as symmetry 
- operators 286, relation to symmetry 
class of tensors 286 ff., for arbitrary 
p. g. 332, characters 320, 383 ff., 
relation to characteristics of unitary 
group 331, use of characters to 
calculate exchange energies 322 ff., 
energy of non-polar bond 346, ex- 
plicit theory of representations 358 ff., 


reciprocity theorems 339, branching 
law 390. 
Perturbation theory 86 ff., for kine- 


matically independent systems 93, 
for equivalent individuals 321 ff., 
for molecules 339 ff.; — p. energy 86, 
for axially symmetric field 192, for 
magnetic field 10I, 193, 204, 224, 
for electric field 101, 224, spin p. 196, 


in Dirac theory 224, determines 
transition probability 89. 
Pfund 46. 


Photo-electric effect 42. 

Photon 42, 49, 54, 104, 248, 258, 261. 
Planck xiii, 41. 

Planck’s radiation law 41, 108. 
Point-field rro. 


Polynomial, characteristic 11, 22 ; — Her- 


mitian 57 f., Legendre 62, with 
spin 230, Laguerre 70. 
Primitive unit 293, character 150, 


symmetry class 358. 


Principal unit of algebra 168, 304; 
— p. transformation 128, transforma- 
tion of Hermitian forms to p. axes 21, 
25, 32, 39, for unitary forms 26, 39; 
— p. quantum number, see under 
Quantum number. 


Probability, relation to intensity 49, 
that a dynamical variable assume a 
given value in a pure state 75, ina 
mixed state 79, p.density and current 
density 50, 215, 217 ; — transition p. 73, 
83, 89, in composite system 90, 93, 
for an atom in radiation field 106 ff. 


Product, see Multiplication. 


Projection, with respect to sub-space 4, 
in unitary geometry 18, orthogonal 
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and unitary-orthogonal 23, linear 


p. = linear correspondence 282. 
Proton, Dirac’s theory of 262. 
Pure state 75, conditions for 77. 


Quantization, in the older quantum 
theory 47, in Schrédinger’s theory 
51, 56, in Heisenberg’s 93 ff., of 


composite system 89, of electro- 
magnetic field 104, 253, second 246, 
of Maxwell-Dirac field equations 


253 ff. ; — directional or space q. 67,75, 
205. 


Quantum, of action 41, 51, of energy 41. 


Quantum kinematics, Heisenberg’s 94 f., 
as Abelian group of rotations 272 ff., 
in second quantization 250. 


Quantum mechanics, general scheme 
74 ff., dynamical law 54, 80, 97, 187, 
266, composition 91, Heisenberg’s 
formulation 93, Schrödingers equa- 
tion 54, 101, Dirac’s equations 213, 
218, Heisenberg-Pauli q. m. of wave 
fields 253 ff. 


Quantum number, auxiliary (2) 228, 
selection rules 233, relation to azi- 
muthal and inner q. n. 228, 233; — 
azimuthal q. n. (Z, i) 64 ff., 142, 196, 
determines orbital moment of mo- 
mentum 65, 196, selection rules 84, 
201, on composition 194, 207, 373, 
relation to auxiliary q. n. 228, 233; 
— inner q. n. (7,7) 189, 196, deter- 
mines total moment of momentum 
179, 189, behaviour on composi- 
tion 190, 194, 206, selection rules 
198, relation to auxiliary q. n. 228, 233; 
—— magnetic (m) 64, 193, determines 
z-component of moment of momentum 
65, 180, 189, selection rules 85, 198, 
of spin and of orbital moment of 
momentum 209, in Dirac’s theory 
232; — principal or total (z) in hydro- 

en 69, in hydrogen-like spectra 85, 
as no group-theoretic significance 
144, true 86, 243, effective 243; — 
radial 64, 144; — spin (s) 206, re- 
lation to valence 369. 


Quantum state 43, 56, 80, 188, simple 


Quaternion 138, complex 182. 


Radial quantum number, see under 
Quantum number. 


Radiation, from atom 44, 83 ff., 105 f., 
224, field 102 ff., 215, 256 f., black 
body 41, 10.4. 
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Ray 4, 20, 
system 75, 
r. field 273, 


Rayleigh 42. 
Real element of algebra 167, 
ating unit 295. 


Realization of group 114, faithful 114, 
contracted 118, 119, of algebra 166; 
— linearr. = representation 120, see 
Representation. 


represents state of physical 
r. field 20, rotations of 
r. representation 181 ff. 


gener- 


Reciprocity theorem, for arbitrary group 
338, for permutation group 339. 


Reduction of correspondences or re- 
presentation 9, 122, uniqueness 136, 
156, complete r. 9, 122, 135 (see 
Complete reduction), sometimes im- 
plies complete r. 18, 123, 136, 292, 301, 
306, 308, of regular representation 
289 ff., 305 ff., of system space of 
equivalent individuals 238 ff., anti- 
symmetric r. for electrons 242, 351 ff., 
symmetric r. for photons 248, 351 fi., 
influence on term spectrum 241, 372 ff., 
general treatment without spin 296 ff., 
with spin 347 ff., for symmetric and 
anti-symmetric cases 351 ff 


Reflection, signature induced by r. 143, 
144, 188. 


Regular representation 289, 
305 ff. 


Relativity theory, special 51, 98 ff., 146 ff., 
of quantum mechanics 210 ff., of 
wave fields 268 f., r. and spin 204, 217, 
222 ff.,; — general 219. 


Representation, of finite group 120, 
of continuous group 160 ff., by ro- 
tations of ray space 181, degree or 
dimensionality 120, character 150, 
complete reduction 122, irreducible 
122, uniqueness of reduction 136, 
156, criterion for irreducibility 159, 
identical 121, equivalent 121, unit- 
ary 136 ff., any r. equivalent to unitary 
r. 157 ;— formal processes: addition 
126, X-multiplication 126, 127, X- 
multiplication 127, I-process 126, 
r. of sub-group 127 ; — of algebra 166, 
304 ff., regular 289 ;— general theory: 
orthogonality properties 157 ff., 317, 
in terms of group algebra 165 ff., 
completeness of system of r. 159, 170, 
318, proved by reduction of regular 
r. 305 ff. For r. of special groups, 
see under qualifying adjective. 

Resonance, between states of same energy 


87, between equivalent individuals 
239 ff., 320. 


reduction 


INDEX 


Resonance line 45. 


Ritz-Rydberg combination principle 44, 
48, 82. 


Röntgen 43. 


Rotation group, in 2-space and its re- 
presentations 140 ff., orthogonality 
of characters 162; in 3-space 
and its representations 142 f., rela- 
tion to unitary group in 2-space 144, 
augmentation by improper rotations 
143, orthogonality of characteristics 
163, completeness 143, 163, 180, 184, 
389, generated by infinitesimal ele~. 
ments 175, representation induced in 
system space 185, 195,372; — in 
m-space 184. 


Rotation in ray space 21, 181, 273, 
representation by r. of ray field 180, 
quantum kinematics as Abelian group 
of r. 272 ff. 


Rupp 50. 

Russell-Saunders coupling 206. 
Rutherford xiii, 74. 

Rydberg number xiii, 45, 69. 


Scalar product, see Multiplication. 


Scalar quantity, commutes with moment 
of momentum and signature 188, 
selection rules 197. 


Schrédinger 48, 50, 56, 102, 187, 216, 
220, 258. 


Schrédinger’s equation 54 ff., relativ- 
istic 101, for system of equivalent 
particles 194, as limiting case of 
Dirac’s 234, derived from com- 
mutation rules 277 ff. 


Schur, I. 152. 
Schwarz’ inequality 30, 393. 


Second quantization 246, see under 
Quantization. 


Secular equation I1, 21, 26, 
theory 88, 209, 344. 


Selection rules 44, 84, 85, for oscillator 
84, for electron without spin 84 ff., 
with spin 232, for scalar quantity 197, 
for vector quantity 197, for auxiliary 
quantum number 233, azimuthal 84, 
201, inner 198, magnetic 85, 198, 
for signature 201. 


in quantum 


Self-conjugate sub-group 119, maxima 
132. 


Semi-simple algebra 316, 


INDEX 


Separation of terms by perturbation 87, 
321, axially symmetric erturbation 
193, in normal Zeeman effect 101, 193, 
198, in anomalous Zeeman effect 204, 
208 ff 


Series, in hydrogen 45, 69, in alkalies 
85, 202. 


Series of composition, see Composition 
series. 


Signature, of representation 143, 
dynamical variable 188, 203, 
tion rule 201. 


selec- 


Simple algebra 311, 313, 
state 189. 


(s?) coupling 206. 


group 132, 


Smekal-Raman effect 224. 
Sommerfeld 193, 236. 


Space, affine, linear, vector t ff., linear 


sub-s. 2, dual 12, unitary 15 ff, 
Hilbert or function 32, 143, reduction 
or decomposition 20, 22, composition 


series 122, 135, product 90, tensor 
125, 281 ff., group s. 115, 160, re- 
presentation 120, 171 ff., algebra as 


vector s. 286, 305, system, see System 


space, 
Space quantization 67, 75, 205. 
Span, space spanned by vectors 3, 20. 


Spectrum, atomic, line s. reduced to 


term s. 44, of hydrogen and 1-electron 


ions 45, in Schrédinger’s theory 69, 
in Dirac’s theory 234, ofalkalies 85 ff., 
doublets 204, of alkaline earths 207, 
246, 3-electron 374, 
periodic table 206 ff., 242 ; — general 
theory, without spin 194, with spin 
206 ff., application of Pauli ex- 
clusion principle 242 ff., group- 
theoretic classification 369 ff., re- 
duction into term classes 283 ff., 320 ff., 
calculation of term values 320 ff.; 
— molecular 191 ; — of characteristic 
numbers 36. 


Spherical harmonics 60 ff., 84, as basis 
of unitary representation in function 
space 142, with spin 230 ff. 


Spin, electron 195, 196, 203, as relativ- 
istic phenomenon 204, 217, 222 ff, 
s. moment of momentum 195, 221, 
magnetic effect 204, 224, s. and 
valence 369 ff.; — s. perturbation 196, 
203, in Dirac’s theory 222 ff.; — s. 
quantum number, see under Quantum 
number. 


Stark effect, linear 102. 


as 


of elements of 
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State of a physical system, represented 
by vector or ray in system space 54, 
74 ff, pure 75, 78, mixed 79, of 
total system under-determined 92 ; — 
quantum or stationary 43, 56, 80, 183, 
simple 189. 


Stationary state, see under State. 


Statistical aggregate 78, 239, canonical 
79. 


Statistics, Bose-Einstein 50. 
Stern-Gerlach effect 65, 75, 205. 
Stieltjes integral 37. 

Stoner’s rule 243. 


Sub-algebra, left-invariant 289, (left- 
and right-) invariant 167, 311, 314. 


Sub-group 116, 334 f., cyclic 1ł7, 
index 118, self-conjugate or invariant 
119, maximal invariant 132. 


Sub-space 2, 32, invariant, under single 
transformation 8, under system of 


transformations 122, equivalent or 
similar 135, 283, see also Invariant 
sub-space. 


Substitution 111, see Correspondence. 


Sum, see Addition ; —s. rule for influence 
of magnetic field, 209. 


Superposition principle 49. 


Symmetric permutation group, see Per- 
mutation group, symmetric. 


Symmetric transformation in tensor space 
282, special 284, Hermitian 283, 
unitary 285, enveloping algebra 284, 
for arbitrary permutation group 332. 


Symmetrization 358. 


Symmetry class of tensors 287, 296, 
primitive 358, of spectral terms 321, 
multiplicity 321, 350 ff., 367. 


Symmetry operator 286, “Young’s 359. 


Symmetry pattern 358 ff., dual on trans- 
posed 361, 368, generated by Young 
symmetry operator 359 ff. 


System space for translation 54, 74, 195, 
for spin 195, total 185, 196, 347 f., 
for equivalent individuals 186, 206 ff., 
347 ;— reduction with respect to 
energy 80, moment of momentum 
188, 206, with regard to symmetric 
permutation group 283 ff, 320 f., 
with regard to Pauli exclusion prin- 
ciple 242 ff., 281 f., 347 ff. 
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class of t. 287, 338, 358, double to 
347 ;— t. space 125, 281 ff., symmetric 
transformation in t. space 282, in- 
variant sub-space 296, reduction 301 ; 
— energy-momentum t. 218. 


Term 44, as energy level or character- 
istic number 46, 56, 80, see also under 
Spectrum, Separation ; — t. order, 
normal 206. 


Thomson, G. P. 50. 


Total quantum number, see 'under 


Quantum number. 


Trace, of matrix or correspondence 11, 
150, of element of algebra 299, 317. 


Transformation, linear 4 = Correspond- 
ence, linear ; — contragredient 12, unit- 
ary 16, principal 128, symmetric in 
tensor space , for arbitrary per- 
mutation group 332, special sym- 
metric 284, canonical 96, in 
quantum mechanics 98 ;—t. toprincipal 
axes 21 ff., 37;— t. group 111, for 
special groups, see under qualifying 
adjective. 


Transition probability 83, 89, in radia- 
tion field 106 ff. 


Translation, left- 116, right- 116. 
Translation, electron 195. 


True quantum number, see under 


Quantum number. 


Uhlenbeck 203. 
Uncertainty: principle 77, 
393- 


Unimodular linear transformation, group 
128. 


derivation 


Unit, element of group 114, of field 302, 
of algebra (modulus or principal unit) 
304, basal 168, 304, idempotent 

enerating 168, 281, independent 
502, primitive 298, real 295 ; — u. 
Hermitian form 15. 


Unitary correspondence, transformation, 
matrix 16 ff., characteristic numbers 
26, infinitesimal 28, u. geometry 
15 ff., u.t.as canonical t. of quantum 
mechanics 98, u. representation of 
group 137 ff. 


INDEX 
Tensor 125 ff, 139, 281, symmetry | Unitary group, in 2-space 137 ff., 


its 
unitary representations Gy 137, com- 
pleteness 137, 163, 389, character- 


istics 151, 163, connection with ro- 
tation group bs 144, augmented 146; 
—ina-space 139 Ñ., reduction of (u) 
and algebra of symmetric transforma- 
tions 285, characteristics 331, 381, 
completeness 381. 


Unitary-orthogonal system of vectors 
or functions 19, 33: completeness 33, 
on group manifold 158. 


Valence 342, 369, v. electron 86, 243 


Vector, v. space, v. geometry 1 ff., in 
Hilbert space 31 ff., v. field 20, co- 
variant and contravariant 13, absolute 
magnitude 16, dual 17, scalar pro- 
duct 16, unitary-orthogonal v. or 
system 16, 19, as element of Abelian 
group 134; —3-v. operator in quantum. 
mechanics 197, selection and intensity 
rules 198 ff., complete system of 
orthogonal v. in 3-space 257, Vv. 
potential of electro-magnetic field 98. 


Vector model of atom, Hund’s 191. 
Velocity, phase and group 53. 


Volume, measure of, on manifold of 
closed continuous group 160, for 
unitary group 386, for unitary uni- 
modular group 162, 389. 


Wave equation, de Brogiie’s 53; 
Schrédinger’s 54 f., 10%, Dirac’s 213, 
218, 225. 


Wave field, Heisenberg-Pauli quantiza- 
tion of 253 ff. 


Wave length 53. 
Wedderburn’s theorem 313. 
Wentzel 74. 

Wien 41. 

Wigner 280, 320. 
Wintner 39. 


Young, A. 358. 
Young’s symmetry operator 359, 


Zeeman effect, normal 85, ror, 193, 198, 
anomalous 198, 204, » 223, for 
doublets 204, for multiplets in gene- 
ral 208 ff. 


